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Chapter 1 

Introduction and preliminaries 



Most problems in engineering and applied mathematics are motivated by some real 
world situation where one wants to make quantitative statements about the behavior 
and evolution of some system. Examples are mechanical engines, chemical processes, 
weather systems, or financial markets. These systems are rarely completely deterministic 
(in the sense that they can be described exactly by a deterministic model), but exhibit 
some randomness in their behavior. Sometimes these effects are negligible, but often 
it is necessary to take them into account. In some applications the influence of the 
random effects dominate over the deterministic structure, and a model that accurately 
captures the stochastic properties of the system is indispensable. If in such a situation 
the randomness is induced by different sources, or if several quantities are influenced 
differently by one source, a central issue is how to describe the dependence between 
these quantities; that the system behaves randomly does not mean that all components 
are independent. 

Dependence between random variables can be very complicated, and not only in high 
dimensions. For example, an important but difficult issue is to find a relevant measure 
of the strength of dependence. It is easy to find examples where a very common such 
measure, the correlation coefficient, does not give any information at aljj. 

Different approaches have been taken to study the concept of dependence, and a variety 
of results exist in probability theory. In this thesis we shall focus on so-called copulas, 
in particular a certain class of copulas called Archimedean copulas. A copula is an 
object that allows to isolate the dependence structure from the marginal behavior of 
a random vector, which probably is the main reason why the concept is theoretically 
useful. It also turns out that arbitrary marginal distributions can be combined to a 
multivariate joint distribution using any specified copula. This makes copulas a useful 

Let X and Y be independent random variables, suppose E[F] — 0, and set Z := XY. Then 
Cov[X, Z] = E[XZ] -E[X]E[Z] = E[X 2 ]E[F] -E 2 [X]E[F] by independence. The covariance is thus zero, 
but X and Z are clearly far from independent. See Embrechts et al. [9] for more on the intricacies of 
the correlation coefficient. 
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tool in practical applications, where it can be relatively easy to model the marginal 
distributions, but much harder to get an idea of the dependence. Using copulas, these 
two problems can be separated. Before continuing, we take the opportunity to mention 
that the copula approach to multivariate modeling is not uncontroversial. See Mikosch 
|21j for a discussion of the drawbacks. 

What is then a copula? Formally it is defined as follows: 

Definition 1.1 A d-dimensional copula, generically denoted C,is a d- dimensional dis- 
tribution function with standard uniform marginal distribution^. 

It is not clear from the definition why copulas are useful for studying dependence. The 
following basic result due to Sklar [26] links the definition of copulas to their role in 
dependence modeling. We omit the proof, which is fairly simple in the case when 
the marginal distribution functions are continuous, and more involved when they are 
not. 

Theorem 1.2 Let H be a d-dimensional distribution function and F\, . . . ,F d its one- 
dimensional marginals. Then there exists a copula C such that 

H(x 1 ,...,x d ) = C(F 1 (xi),...,F d (x d )). 

C is uniquely determined on Ran F± X . . . X Ran F d , where Ran Fi is the range of F^. 
Conversely, given a copula C and univariate distribution functions Fx, ... , F d , the above 
relationship always defines a d-dimensional distribution function H. 

Sklar's theorem clarifies the role of the copula C as carrier of the dependence structure, 
fully separated from the marginals. A result similar to Theorem 11.21 holds for survival 
functions. Although completely analogous we state it explicitly, since in our context 
survival functions will occur frequently. 

Theorem 1.3 Let H be a d-dimensional survival function and Fi,...,F d the corre- 
sponding one- dimensional marginal survival functions. Then there exists a copula C 
such that 

H(x 1 ,...,x d ) = C(F 1 (x 1 ),...,F d (x d )). 

The copula is uniquely determined on Ran Fi x . . . x Ran F d . Conversely, given a copula 
C and univariate survival functions Fi,...,F d , the above relationship always defines a 
d-dimensional survival function H. 

The object C in Theorem 11.31 is called the survival copula of H (or H). It is easy to 
show that if C\ is the (ordinary) copula of H, and C2 is its survival copula, then 

(Ux,...U d ) ~Ci if and only if (1 - U 1} . . . , 1 - U d ) ~ C 2 , 

where as usual the symbol ~ stands for "is distributed according to" . It is important to 
be clear about the distinction between a copula and the corresponding survival copula. 

2 Note that the condition on the marginals implies that the support of any d-dimensional copula is 
contained in the unit cube [0, l] d . 
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This sometimes becomes confusing, especially when the tails of a distribution are related 
to the tails of the copula (or survival copula). For example, the dependence in the upper 
tail of a distribution H is captured by the upper tail of its (ordinary) copula, but by the 
lower tail of its survival copula. 

We note that Sklar's Theorem implies that given a joint distribution function H and its 
marginals Fx, . . . , F4, the copula can be expressed as 

C{u u ...,u d ) = H{F^( Ul ), . . . , F^(u d )). (1.1) 

Here Ff~ is the generalized inverse of Fi, defined as 

Ff-(u) := mf{x G R\Fi(x) >u}, u G [0, 1]. 

Not all distribution functions F are invertible, but since they are non-decreasing the 
generalized inverse is always well-defined. The generalized inverse of the survival function 
Fi is defined as F£~(u) := F^(l — u) for u G [0,1]. Similarly as in (jl.ip . the survival 
copula can be extracted from H and Fx, . . . , F^. 

As already mentioned, this thesis is devoted to the class of Archimedean copulas. A lot 
of research has been done in this field, both concerning the general theory of copulas 
and Archimedean copulas in particular. Nelsen [22] contains a rigorous mathematical 
introduction, and McNeil, Frey and Embrechts [20J gives a good account of how to 
apply copulas in (financial) risk management. In this thesis we will exploit some recent 
advances in the understanding of the structure of such copulas. We give an introduction 
to Archimedean copulas and derive the necessary results and tools. This is carried out 
in Chapter [2j In particular our aim is to emphasize how these copulas arise as the 
dependence structures of certain mixture models. 

The subsequent analysis will focus on the extremal behavior of Archimedean copulas. 
More precisely, we are interested in questions like how to describe the dependence be- 
tween the largest observations (in a suitable sense) of a multivariate sample, or the de- 
pendence between observations exceeding some high threshold. Another issue is to what 
extent large outcomes occur simultaneously. All these concepts will be made precise as 
they are investigated. Some standard tools for dealing with such issues are introduced in 
Chapter El where we give a brief account of extreme value theory (EVT) and the theory 
of regularly varying functions. 

Chapter [H which contains the main theoretical contribution of this thesis, concerns 
the asymptotic behavior of a certain integral transform that proves to be of central 
importance in the theory of Archimedean copulas. The results in this section can, and 
should, also be interpreted as properties of the tails of certain Beta mixtures. 

The last part of the thesis, Chapters [5H3 have the character of exercises in extreme 
value theory. It is shown how the results and concepts developed in Chapters [2HH 
make it straightforward to determine extremal properties of Archimedean copulas, such 
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as coefficients of tail dependence, extreme value limits and limiting threshold copulas 
(notions that will be given precise definitions later on). 

It is worth noting that we only are concerned with distributional properties of the random 
elements under considerations. We therefore only remark in passing that one always can 
find a probability space (O, J 7 , P) where all random elements live. This is a consequence 
of Skorokhod's Representation Theorem. That said, there will be no further mention of 
probability spaces. 

Now some remarks on notation and terminology. Capital letters (X, Y, U, . . . ) denote 
random variables. Vectors are written in boldface — so, for instance, X = (X\, . . . , X^) 
is a random vector taking values in M. d , and x = (x\, . . . ,Xd) is a d-dimensional deter- 
ministic vector. The letter d will denote dimension, and is always an integer greater than 
or equal to two. Equalities, inequalities and arithmetic operations are to be interpreted 
componentwise. For example, x > y means ar, > yi for all i. 

With a slight abuse of language we will say that H is a survival function on the set 
A C M d , when we mean that the corresponding random vector takes its values in A with 
probability one. In particular, saying that H is a survival function on R'J. := {x 6 M. d \ 
x > 0} means that a random vector X ~ H satisfies X £ PJj_ a.s. Saying that H is 
a survival function on the interior of means the same thing, except that we now 
exclude probability mass on the boundary of Note that such statements about H 
do not mean that the function x i— > H(x) has its support confined to IR^ or its interior. 
They do, however, imply that H is completely determined by its values in this set. 

Several results, particularly in the last chapters, will concern weak convergence of random 
variables, denoted X n =4> X as n — > oo. This means that the corresponding distribu- 
tion functions F n of X n converge weakly to the distribution function F oi X, i.e. that 
liim^oo F n (x) = F(x) for all x where the limit F is continuous. Weak convergence of 
random vectors, X n =^ X, is defined analogously. 

Finally, since distributions of mixtures, i.e. products of two random variables, will be 
a central theme, conditioning arguments are crucial. This can be quite a delicate issue 
(see e.g. Proschan and Presnel [23] for good illustrations of some of the pitfalls), which 
we nevertheless can circumvent using the following result: 

Lemma 1.4 Suppose that the M n -valued random variable X and the W 71 -valued random 
variable Y are independent, and let h : M n x M m — > M be a measurable function such 
that f(x) := E[/i(a;,"K)] is well-defined for all x G M n . Then 

E[h(X,Y)\X]=f(X). 

In particular, if X is scalar with distribution function F, then 

¥[XY > z] = [ F[Y > t- 1 z]dF{t). 
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Proof. The first part is Corollary 4.38 in Breiman [3]. For the last part, let h(x,y) := 
^{y>x-^z} and note that P[XY > z] = E[¥[Y > X^z\X]\ = E[E[h{X,Y)\X}}. Since 
f(x) = P[y > x _1 z], the result follows from the first part of the lemma. □ 
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Chapter 2 

Archimedean copulas 



Archimedean copulas originally emerged as a family of copulas characterized by cer- 
tain algebraic properties. This chapter is devoted to investigating their structure and 
characteristics. Section [2.11 defines Archimedean copulas and the so-called Archimedean 
generators. Some important examples are given. We then continue with some theoretical 
background on al-monotone functions (Section I2.2|) and l\-norm symmetric distributions 
(Section [22]). This theory is necessary to prove (and understand) the main result, Theo- 
rem [2J21 which is given in Section \2Al This result shows that Archimedean copulas can 
be characterized as the dependence structure of the very well-behaved family of ^i-norm 
symmetric distributions. It also characterizes the relevant class of Archimedean genera- 
tors. Further we discuss interpretations of the Archimedean dependence structure. Sec- 
tion 12.51 concludes with a result that clarifies the correspondence between Archimedean 
copulas, their generators and ^i-norm symmetric distributions. The theory presented in 
this chapter (except Section I2.5P is based on McNeil and Neslehova [19] . 

2.1 Definition and examples 

Definition 2.1 (Archimedean copula and Archimedean generator) A d-dimensional cop- 
ula C is called Archimedean if it has the representation 

C(u) = + • • • + i>~ l (u d % u € R d + , 

where tp : R+ — > R+ is an Archimedean generator, i.e. continuous, strictly decreasing 
on {ip > 0} and satisfying ip(0) = 1 and lim x _ ) . 0O ip{x) = 0. Here if) -1 is defined as 
il)~ l {u) := inf{x > 0|t/>(x) = u}. 

If ^~ 1 (0) = oo the generator is said to be strict, otherwise non-strict. 

The function is sometimes called the pseudo-inverse of ip. Due to monotonicity and 
continuity of ip, it coincides with the usual inverse of the restriction of ip to {ip > 0}. If 
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ip is strict, this set is the whole of R+. Non-strict generators are not invertible, but ip 
always is a right inverse of ifi, i.e. ifi o ip^iu) = u for all u £ [0, 1]. 

Many authors define Archimedean copulas in terms of <fi = ip" 1 , and refer to this function 
as the generator. Although at first sight it is just a matter of convention, our definition 
turns out to be much more natural for studying the structure of these copulas. In this 
thesis, rp is thus the generator, and cj) = ip~ l is referred to as the inverse generator. 

An important observation is that ip defines a survival function on R + . This will be used 
later on. 

Example 2.2 The copulas H(u) := u\U2 ■ ■ ■ Ud (the independence copula,) and W(ui, U2) '■= 
(u\ +U2 — 1)+ (the countermonotonicity copula,) are both Archimedean, with generators 
ipu(x) = e~ x and tpw(x) = (1 — x )+> respectively. 

The copulas Cg" l (u) generated by tpg(x) = (1 + Ox)^ 1 / 6 , > —1 (the Clayton copulaj, 
and Cf u {u) generated by ^f u (x) = exp{— x 1 / 8 }, 9 > 1 (the Gumbel copula,), are two 
examples of parametric families of Archimedean copulas. They are among the most 
commonly encountered, and turn out to be particularly important in our context. 

It is easy to check that the functions ipn, V'W) ^0 an d tp^ n are all Archimedean gener- 
ators. As for W (in the bivariate case) and II (in any dimension), we even know that 
these functions are copulag^. But we have not checked that the parametric families of 
claimed copulas are indeed copulas in the sense of Definition 11.11 To do this we need 
conditions for when an Archimedean generator ip actually generates an Archimedean 
copula in a given dimension. In Section 12.41 we will characterize these generators and 
derive a very useful stochastic representation for Archimedean copulas. First, however, 
we need some background. 



2.2 d-monotonicity and the Williamson d-transform 

In the paper [27], R. E. Williamson studies the properties of real- valued functions having 
the following regularity property: 

Definition 2.3 (d-monotonicity) A function f : [0, 00) — > R is called <i-monotone for 

d>2 if it is d — 2 times differentiable in (0, 00) and the derivatives satisfy 

(-l) k fW(x)>0, fc = 0,l,...,d-2, 

and (— i)d-2y(d-2) 2 S non-increasing and convex in (0, 00). It is said to be completely 
monotone if the above holds for all d > 2. 

x The marginals are clearly standard uniform. II is a product of univariate distribution functions and 
hence itself a distribution function (with independent marginals). For W, observe that W(ui,U2) = 
f[U < ui, 1 — U < Ua] for U standard uniform. 
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Note that although /( rf_2 ) not necessarily has a derivative, it is convex and hence ab- 
solutely continuous on (0, oo). For ease of notation, we will write f^ d ~^ for the right- 
continuous version of the density of f( d ~ 2 \ 

Williamson |27] gives a representation theorem for ci-monotone functions in terms of a 
certain integral transform, whose basic properties he also derives. These results were 
cast into the form presented here by McNeil and Neslehova [19J. 



Definition 2.4 (Williamson d-transform) Let X be a non-negative random variable 
with distribution function F and d > 2 an integer. The Williamson (i-transform of 
X (or F) is a real function on [0, oo) given by 



aww = jf(i-f 




Proposition 2.5 (Properties of the Williamson d-transform) Let F be a distribution 
function on R + . The Williamson d-transform f = WdF satisfies the following properties: 

(i) (Right continuity) f(x) — > /(0) = 1 — F(0) as x 10. 

(ii) ( Order of decay) x k f^\x) — > as x — >■ oo for k = 0, 1, . . . , d — 1. 
(Hi) (Lnvertibility) The Williamson d-transform is invertible, and 

d-i , 

k\ 



f{x) = w~ d l f{x) = i - Y, { -^rx k f ik) (x), ^ > o. 



k=0 

Proof. Part (i) follows from the Dominated Convergence Theorem upon noting 
that 

f X \d—l / x \ d— 1 



^ 1- ^J = \ 1 ~XJ 1 {x>x} ^ l{x>o} a.s., a; 4.0. 

Part (ii) is shown by Williamson [27], Lemma 1 (ii), and part (Hi) is Proposition 5 (ii) 
in McNeil and Neslehova \M- □ 



The following result is the main reason for studying the Williamson d-transform in 
connection with Archimedean copulas. Its relevance will be apparent later on. 

Theorem 2.6 (Representation theorem) Let f be a real-valued function on M + . Then 
f is d-monotone with f(0) = p € [0, 1] and lini;^^ f(x) = if and only if f = WdF for 
some distribution function F on M+. 

Proof. The proof can be found in McNeil and Neslehova [19], Proposition 5 (i). 
It is essentially a reformulation of Williamson's original proof in terms of distribution 
functions. □ 



In Williamson's paper, the d-transform is viewed in the context of real analysis as a map- 
ping on a set of non-decreasing functions on the positive half-line. With the formulation 
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here we view it in a probabilistic sense as a mapping on the set of distributions supported 
on the positive half-line. Using Lemma 11.41 (for the second equality below), this proba- 
bilistic interpretation can be taken further by noting that, for G(x) = (1 — x)^ 1 , 

roc 

W d F(x) = / G(x/t)dF(t) = E[¥[X > x/Y\Y]] = F[XY > x], 
Jo 

where X and Y are independent, X ~ F, and Y ~ G and hence follows a Beta(l, d — 1) 
distribution. The Williamson d-transform thus describes the relationship between non- 
negative random variables X and certain Beta mixtures XY. 

We conclude this section with a result that is the key to analyzing the structure of 
Archimedean copulas. 

Proposition 2.7 Let f be a real-valued function on [0, oo) and H given by 

H{x) = f( Xl + --- + x d ), xeR d + . 

Then H is a survival function on the interior of Mi if and only if f is d-monotone and 
satisfies f(0) = 1 and linr^oo f(x) = 0. 

Proof. See Proposition 2 in McNeil and Neslehova |19| . The proof consists in showing 
that d-monotonicity is a necessary and sufficient condition for the (possibly signed) 
measure induced by H to put non-negative mass on so-called rectangle sets, i.e. sets of 
the form (a\,b\\ x . . . x (ad,bd\ C Mi. Together with conditions on H and / at the 
origin and infinity, this is equivalent to H inducing a probability measure. Note also 
that in our terminology, H being a survival function on the interior of excludes it 
from assigning mass to the boundary of this set. □ 



2.3 4-norm symmetric distributions 

Although the relatively simple algebraic form (see Definition 12. 1[) is one of the reasons 
why Archimedean copulas are popular in some situations, this form does not help to 
interpret the corresponding dependence structure. Moreover, in many cases computa- 
tions still become involved and seldom lead to any deeper understanding of why results 
hold true. This is in particular true in high dimensions, where calculations become 
cumbersome. Instead of working directly with the copulas, we would therefore prefer, if 
possible, to work with some suitable class of distributions having a simple structure. The 
family of l\-norm symmetric distributions introduced in this section turns out to be an 
appropriate class. In what follows, S d will denote the d-dimensional unit simplex: 

d 

S d := {x € | ||x[|i = 1}, where = ^ 
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Definition 2.8 A random vector X onWt is called £i-norm symmetric if there exists a 
non-negative random variable R and a random vector S, independent of R and uniformly 
distributed on the d-dimensional simplex S d , such that 

X = RS. 

The random variable R is referred to as the radial part of X and its distribution as the 
radial distribution. 

The stochastic representation in terms of a certain simplex mixture makes ^i-norm sym- 
metric distributions very tractable objects. In the following we restrict attention to the 
case where R places no mass at the origin, i.e. R > a.s. 

Proposition 2.9 Suppose X = RS is i\-norm symmetric with R > a.s. Then 

(i) R = ||X||i and S = 

(ii) ||-X"||i and X/||-X"||i are independent. 

Proof. The proof relies on the following fact: if / : R n — > R m is a measurable map 
and Y = Z are identically distributed R n -valued random variables, then f(Y) = f(Z), 
see P3], p. 13. Letting / : y ^ {\\y\\i, y/\\y\\i) for y G R d \{0} we get 

(\\X\\i, = f(X) 4 f(RS) = (R, S), 

using that ||X||i > a.s. Now (i) clearly follows, and since we applied the result to the 
joint distribution of (R, S), we also get (ii). □ 



The following characterization of £i-norm symmetric distributions without point mass at 
the origin will be crucial in the sequel. Note that the proof of the direct half, (i) implies 
(ii), uses results on ci-monotone functions. This fundamental connection between d- 
monotonicity and i'l-norm symmetric distributions is also evident when comparing the 
next result with Proposition 12.71 

Proposition 2.10 The following are equivalent: 

(i) H(x) = f(x\ + . . . + Xd), x G is a survival function on the interior o/R^. 

(ii) H is the survival function of the t\-norm symmetric random vector X = RS, 
R>0 a.s., and f(x) = W[RSi > x] = F[Xi > x}. 



Proof. (ii) 
have 

PLY > x] 



(i): Let F be the distribution function of R. By Lemma 11.41 we 
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dF(t), 



x G 
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and similarly 



f(x) = FIRS! > x] 



Si > 



x 



dF(t), 



x > 0. 
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It is well-known (see e.g. Fang et al. [IT], Theorem 5.2 (i)) that the joint survival function 
of S ~ unif(5 d ) is given by F[S > x] = (1 - ||as || i)^ -1 , a: G Rf . Hence P[5i > x] = 
(1 — x > 0, and we get 

F[S > x] =F[Si > ||x||i], z;GR+. 

This finally implies that 



\X > x] 



Si > 



\x\\i 
t 



dF(t) = f(\\x\\ 1 ), 



x G 



(i) ==> (ii): Since (i) holds by assumption, / has to be d-monotone by Proposition 12.71 
The representation theorem for d-monotone functions, Theorem l2.6t then gives a random 
variable R with distribution function F such that / = W d F. Due to the invertibility 
of the Williamson d-transform (Proposition 12. 5|) . F is uniquely determined by /, and 
by the discussion after the proof of Theorem 12.61 W d F(x) = ¥[RS\ > x] since we know 
that Si ~ Beta(l, d — 1). Finally, take S uniform on S d and independent of R, and 
define X := RS. As in the proof of the first part of the theorem it follows that H is the 
survival function of X . □ 



2.4 Archimedean copulas: characterization and stochastic 
representation 

After the excursions in the two previous sections we now return to the Archimedean 
copulas and the problem of their characterization and representation. We will see how 
the theory of d-monotone functions and ^i-norm symmetric distributions enters the 
picture. The connection is actually little more than a simple observation; we use Sklar's 
theorem together with the fact that an Archimedean generator ip is a survival function 
on R+. 

Suppose C is an Archimedean copula generated by ip. If ip is strict, the equality 

H(x) := C{4>(xi), iP(x d )) = ip(xi + ...+ x d ), x€R d + 

certainly holds. If ip is non-strict, the equality holds if xi < for all i, since in 

this case o -ip(xi) = X{. If on the other hand Xi > V> -1 (0) for some i, the right side 
is clearly zero, as is the left side since ip(xi) = and C is a distribution function with 
uniform marginals (hence no mass on the boundary of [0, l] d ). So the equality holds 
also in this case. Sklar's theorem now implies that H is a joint survival function on 
Ri, and even on the interior of R^_ since ^(0) = 1 by the definition of Archimedean 
generators. 

Conversely, if a d-dimensional survival function H on the interior of R^ has the form 

H(x) = ifj(xi + ... + x d ), x G R+ 
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for some function ip, the marginal survival functions are clearly ip(x), and Sklar's theorem 
gives the form of its survival copula C as 

cw = ^(f(«i) + ... + fW). 

If in addition ip is an Archimedean generator we have = tp -1 , and C is Archimedean 
by definition. We formulate these conclusions as 

Lemma 2.11 The function C : [0, l] d — > [0, 1] given by 

C(u)=^(ip- 1 (u 1 ) + ... + iP'\u d )), u€[0,l} d (2.1) 

is a d- dimensional Archimedean copula if and only if 

H( X ) = i/j(x 1 + ... + x d ) = iP(\\x\\i), xeR d + (2.2) 

is a joint survival function on the interior of R^ . In this case, C is the survival copula 
of 3. 

Proof. That C is the survival copula of 3 is immediate from the discussion above. 
The equivalence assertion also follows as soon as we prove that the assumption (|2.2|) on 
3 implies that ip is an Archimedean generator. Clearly ip(x) — > as x — > oo since 3 
is a survival function. Moreover, ip(0) = 1 since 3 is a survival function on the interior 
of Ri. To deduce continuity and strict monotonicity we must invoke Proposition 12. 7| 
which says that ip is <i-monotone. In particular, ip is convex and hence continuous on 
(0, oo), and since ()2.2p implies that ip is a marginal survival function, it must be right 
continuous everywhere which excludes a jump at 0. Finally, since ip(x) — > as x — > oo, 
convexity implies that ip is strictly decreasing on the set where it is positive. □ 

It is clear from Lemma [2.111 that if ip generates a d-dimensional copula, it also generates 
/c-dimensional copulas for all k < d. Indeed, ip(xi + . . .+Xj~) is the fc-dimensional marginal 
of 3{x) = ip(xi + . . . + Xd), and we may apply Lemma f2. Ill to this function. 

We are now in a position to put the pieces together and give the main result that describes 
the structure of the Archimedean copulas. It is clear that the common denominator is 
the class of distributions whose survival functions can be written as a function of the 
^i-norm of the argument. Due to Proposition 12. 10|. these are precisely the ^i-norm 
symmetric distributions without a point mass at the origin. 

Theorem 2.12 (Main result) The following are equivalent: 

(i) C(u) = ijj{ip~ l {ui) + . . . + ip^iud)) is a d-dimensional Archimedean copula. 

(ii) ip is a d-monotone Archimedean generator. 

(Hi) S(x) = tp(xi + . . . + Xd), x G is a survival function on the interior o/Ri. 

(iv) 3 is the survival function of the t\-norm symmetric random vector X = RS, 
R>0 a.s., and 4>{x) = F[RSi > x] = F[X X > x}. 
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If (i) — (iv) hold, C is the survival copula corresponding to the survival function H , and 
ip = 2Ud-F, where F is the distribution function of R. 

Proof. The equivalence (i) -<=^ (Hi) as well as the assertion that C is the survival 
copula of H follow from Lemma 12.111 Moreover, (ii) <^=^> (Hi) and (Hi) <^=^ (iv) follow 
from Proposition 12 . 71 and Proposition 12.10} respectively. Finally, since S\ is Beta(l, d— 1) 
distributed, ip = ¥[RS\ > x] implies that tp = %B d F as in the discussion after the proof 
of Theorem 12.61 □ 



Corollary 2.13 Let U be distributed according to the d- dimensional Archimedean cop- 
ula C with generator Then X := (ip^iUi), . . . ,il)~ l (U d )) is l\-norm symmetric with 
radial part R ~ F satisfying R > a.s. and ip = W d F. 

Proof. The survival function of H of X := (ip~ l (Ui), . . . , V ;_1 (^ r d)) is 

H(x) = PhTHt/i) > xi, . . .,i>-\U d ) > x d ] = Fp! < ...,Ud< ^(x d )}, 

since ip is decreasing. U is distributed according to C, so 

H(x) = ^~ 1 (rP(x 1 )) + ... + ip-H^))) = VfllajHi), x € R d + , 

where the last equality follows as in the proof of Lemma 12.111 The assertion now follows 
from Theorem 12.121 □ 



Part (ii) of Theorem 12.121 gives a necessary and sufficient condition that can be used to 
check whether a given Archimedean generator indeed generates a d-dimensional copula. 
One can for instance show that this is the case for the Gumbel generator ijj^ u (x) in 
Example 12.21 if 9 > 1. The Clayton generator rpg(x) turns out to be (i-monotone for 
9 > —l/(d— 1), see McNeil and Neslehova [19) . Example 4. Part (iv) gives a stochastic 
representation that greatly simplifies computations and can be used for simulation pur- 
poses. It is also the key to important interpretations of the Archimedean dependence 
structures, which is essential when justifying the use of these copulas in practice. 

1. We know that an £i-norm symmetric random variable X = RS has the properties 
that || JSC || ! = R and X/||X||i = S. Hence it can be viewed as an allocation of 
a random total amount it! of some resource among d units, such that Si is the 
fraction allocated to unit i. 

2. Moreover, ||-X"||i and X/||X||i are independent, meaning that the individual frac- 
tions do not depend on the total allocated amount. 

3. Finally, requiring uniformity of S on the unit simplex S d amounts to saying that 
all possible combinations of allocated fractions are equally likely to occur. This 
clearly imposes a strong symmetry on the distribution. 

Archimedean copulas are the natural dependence structures of such "fair-share" distri- 
butions. One application from finance is in credit risk, where large portfolios of credit 



15 



deals (e.g. loans) are commonplace. In many cases such portfolios, or sub-portfolios, 
possess a large degree of homogeneity, which is consistent with point 3. Viewing the 
total loss as the "resource" (in credit risk it is common practice only to consider losses; 
R will thus be positive) and assuming that the size of the loss does not influence its 
distribution between clients, we have 1. and 2. The latter is reasonable only if the 
client exposures are of comparable orders of magnitude; knowing that the total loss was 
large, one could otherwise deduce that this large loss probably was caused by defaults 
among high-exposure clients rather than low-exposure clients. This would contradict 
independence between ||-X"||i and Jf/||X||i. 

2.5 A supplementary result 

Due to the invertibility of the Williamson d-transform, Theorem 12.121 implies that 
there is a one-to-one correspondence between a.s. strictly positive random variables, 
or, equivalently, ^i-norm symmetric distributions without a point mass at the ori- 
gin, and d- monotone Archimedean generators. The correspondence between the gen- 
erators and the Archimedean copulas is, however, not one-to-one. Indeed, ip{x) and 
ip(x) := tp(cx), c > 0, give rise to the same copula. The corresponding radial distribu- 
tions are R = 2Xr^ 1 ?/> and R := R/c. The next result shows that for a fixed copula C, 
the possible generators (or radial parts) cannot be more different than that. 

Proposition 2.14 (i) Two Archimedean generators ip and ip generate the same cop- 
ula if and only ifip(cx) = ip{x) for some c > 0. 

(ii) Let X and X be l\-norm symmetrically distributed. They have identical survival 
copulas if and only if their radial parts R and R satisfy R = R/c for some c > 0. 

Proof. Due to the one-to-one correspondence between the generators and radial parts 
as discussed above, it suffices to prove one of the two statements. We give a proof of 
(i). Sufficiency follows immediately. For necessity, suppose ip and ip generate the same 
copula. Then 

^-\ Ul ) + ■■■+ ^\u d )) = + • • • + Ui G [0, 1]. 

Now define T := ip" 1 o ip. Since the pseudo-inverse always is a right inverse, we have 
T o ip" 1 = ip~ l and ip o T = ip. The above equality is thus equivalent to 

i{>(T o ^{m) + ■ ■ ■ +T o tp-\u d )) = ^o TfoT^ui) + • • • + ip~ l {u d )), 

or, with Xi := ip~ l {ui) G [0, x*], where x* := V' _1 (0), 

^{T(x x ) + ---+T{x d )) = $oT{ Xl + ---+x d ) : xi€[0,x*]. (2.3) 

We would now like to apply ip~ l on both sides of ()2.3p to get a functional equation for 
T, but ip is only invertible on {ip > 0}. Now, if x% + • • • + x d < x* then T{x\ + - ■ ■ + x d ) < 



16 



V' _1 (0), implying that the right side of (|2.3|) is strictly positive. The left side is then also 
strictly positive, and since x\ + • • • + x d < x* implies that X{ < x* for each i (all Xi are 
positive) we conclude 

T(xi) H h T(x d ) = T(x 1 H h x d ), x; > 0, xH \-x d < x*. 

By continuity of T this holds also for x« > 0, x\ + ■ ■ ■ + x d < x* . With x\ = ■ ■ ■ = x& = 
we get cZT(O) = T(0), so T(0) = 0. Moreover, setting x% = ■ ■ ■ = Xd = and using that 
T(0) = yields 

T(xi) + T(x 2 ) = T(x\ + x 2 ), xi,x 2 > 0, xi + x 2 < x* . 

By Lemma 12.151 below, the only solutions to this functional equation are T(x) = cx, 
x € [0, x*], where c G M. Hence ip(x) = Tp(cx) for x E [0,x*], and since both ip and V' are 
Archimedean generators it is immediate that c > 0. If x* = oo we are done. Otherwise 
we note that i/j(x*) = 0, and so ip(x) = t/j(x/c) = for x > cx*, since ip is non- increasing. 
□ 

Lemma 2.15 Let f be a real-valued, continuous function on R + such tha$ 

f(x) + f(y) = f(x + y), x,y>0,x + y<a, (2.4) 

where a G (0, oo]. Then /(x) = cx for all x G [0, a] and some c G R. 

Proof. We prove the lemma in the case where a < oo. If a = oo the proof is simpler 
in the sense that one does not have to be as careful to avoid going outside [0, a], where 
the functional equation is defined. Otherwise it is very similar. 

We may without loss of generality assume that a = 1. Once this case is established, the 
general case follows by considering /(x) := f(ax). Moreover, by continuity of / we have 
(|2.4|) for x, y > 0, x + y < a. Now assume that 

(i) f(nx) = n/(x) for all x such that < nx < 1 

holds for some n G N, and take x such that < (n + l)x < 1. Then 

/(nx + x) = /(nx) + /(x) = n/(x) + /(x), 

using (|2.4p and (i). The latter is trivially satisfied for n = 1, so by induction it holds for 
all n. Take now m G N and x G [0, 1]. Then G [0, 1], so (i) yields /(x) = /(^™) = 
m/(^), or, equivalently, 

(") /(^x) = ^/(x) for all x G [0,1]. 

2 Note the similarity to the well-known Cauchy functional equation: /(x) + /(y) = /(a; + y) for all 
a; £ R. See Bingham et al. [2], Section 1.1, for a treatment. However, although similar, our case requires 
a separate proof. 
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Finally take x G [0, 1] rational. Then x = ^ for some n G N U {0} and m G N with 
m>n. Since < r?A < 1 and ^ < 1, we may apply (i) and (ii) to obtain 

m = f(-)=nf ( -T) = ^/(l) = */(!). 

With c := /(l) we thus have / (x) = cx for all rational x G [0, 1], and by density and 
continuity it also holds for all x G [0, 1]: take x„->i£ [0, 1] with x n G [0, 1] rational, 
and deduce that f(x) = lim n _ ) . 0O f(x n ) = cx. □ 
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Chapter 3 



Regular variation and extreme 
value theory 



3.1 Regular variation 

We summarize here some basic results from the theory of real-valued regularly varying 
functions. The main reference for this theory is Bingham et al. [2j, and the results are 
stated (and occasionally proven) in many other places as well. Two good references 
are Embrechts et al. [10] and Resnick [25] . The latter reference also discusses so-called 
multivariate regular variation in the context of multivariate extreme value theory. 

We use the following notation: for two positive functions / and g we write f(x) ~ g(x), 
x — > oo, if lim^oo f{x)/g{x) = 1. Whenever f(x) ~ appears, it is to be understood as 
liniz^oo f(x) = 0. There will be no danger of confusing the two meanings of the symbol 



Definition 3.1 A positive, measurable function f on (0,oo) is called regularly varying 
with index p, written f 6 RVp ; if for any c> 0, 



It is called rapidly varying (with index —oo), f 6 RV_oo, if 



Functions in RVo are called slowly varying. 

Note that a regularly varying function / £ RV p always can be represented as f(x) = 
x p L(x), with L £ RVq; just set L(x) := f(x)x~ p . In this thesis, the function / will 
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usually be the tail of some distribution function F, i.e. / = 1 — F = F. We will then 
sometimes write X G RV P for a random variable X ~ F to denote that F G RVp. 

Theorem 3.2 (Karamata's theorem) Let L G RVq be locally bounded in [0, oo]. Then 

(i) If a < 0, £/ien t^ 1 L(t)dt ~ -x a L(x)/a, x ^ oo. 

lfa>0, then /* r -1 L(*)<ft ~ x* 7 /(x)/<r, x — > oo. 

(mj TTie function x \— > f^ t~ l L{t)dt is slowly varying, if finite. 

Proof. These are Theorem 1.5.10, 1.5.8 and 1.5.9a in Bingham et al. [2]. □ 

Theorem 3.3 (Monotone Density Theorem) Suppose U(x) ~ cx p L(x), x — > oo, where 
c G R, /0 G R, L G i?Vb and suppose u is some eventually monotone function. 

(i) IfU(x) = u(y)dy, then u(x) ~ —cpx p ~ 1 L(x) as x — > oo. 

(ii) IfU(x) = Jq u(y)dy, then u(x) ~ cpx p ~ l L{x) as x — > oo. 

Proof. (ii) is Theorem 1.7.2 in Bingham et al. [2], and the proof of (i) is virtually 
identical. □ 

Theorem 3.4 Suppose f G RV-oo is non-increasing and take a G R. Then f^ t a f(t)dt < 
oo for some xq > 0, and 

rt a f{t)dt 

lim J* 1 y ' = o. 

z->oo X a+1 f(x) 

Proof. This is (part of) Theorem A3. 12 in Embrechts et al. [10J. □ 



3.2 Extreme value theory 

Univariate extreme value theory in its most basic form is concerned with the study of 
limit relations of the form 

F n (c n x + d n ) — > G(x) weakly, n — > oo, (3-1) 

where F and G are distribution functions with G non-degenerate, c n > 0, and d n G R. 
The probabilistic interpretation is the following: Suppose X\, X2, ■ ■ ■ are i.i.d. random 
variables distributed according to F, and let M n := max(Xi, . . . , X n ) denote the largest 
of the first n variables in the sequence. Then, by independence, P[M n < x] = ¥ n [Xi < 
x] = F n (x), and after translation and scaling by d n and c n , P[c~ 1 (M n — d n ) < x] = 
F n (c n x + d n ). The relation (|3.ip thus expresses weak convergence of the normalized 
maxima c~ (M n — d n ). 
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If (|3,1|) holds for some specific F and non-degenerate G, we say that F lies in the maxi- 
mum domain of attraction (or just domain of attraction) of G, denoted F G MDA(G). 
The first issue to resolve is whether the limit G is unique. More precisely, is it possible 
to find normalizing sequences in (|3.ip so that for the same F we get convergence to two 
different limits G\ and G-p. It turns out that this can happen, but in that case G\ and 
G2 are of the same type, meaning that Gi(x) = G2(Ax + B) for some constants A > 
and fi £l. This is part of the Convergence to Types Theorem (see Proposition 0.2 in 
Resnick [25]). 

The next task is now to determine the possible limits G. This was done by Fisher 
and Tippett [13], and it turns out that the limit (up to type) belongs to one of three 
families: 

(i) The Frechet family: & a (x) = exp{— x~ a }, x > 0, for some a G (0,oo). 
(ii) The Gumbel distribution: A(x) = exp{— e~ x }, 

(Hi) The Weibull family: ^ a (x) = exp{— (— x) a }, x < 0, for some a G (0,oo). 

For each of these three families it now remains to characterize the respective domains of 
attraction and the corresponding normalizing sequences (c n ) and (d n ). 

Theorem 3.5 Let F be a distribution function. 

(i) (Frechet case) F G MDA($ a ) if and only if F G RV- a for a G (0, 00). 

(ii) (Gumbel case) F G MDA(A), if and only if there exists a positive function a, 
called the auxiliary function of F, with a(x)/x — > as x — > x* , where x* is the 
right endpoint of F, such that 

Fix + a(x)t) f , , TO , 
lim v — ~ = e~\ \ft G M. (3.2) 
x-fx* F(x) 

In this case one may choose a(x) := f x F(t)dt/ F(x). If F has a density, we may 
alternatively choose a(x) := —F(x)/F'(x). It is always possible to find an auxiliary 
function that is absolutely continuous with lima, _-.£.* a'(x) = 0. 

(Hi) (Weibull case) F G MDA( X I' Q ,) if and only if F has finite right endpoint x* and 
F(x* -x~ l ) G RV- a forae (0,oo). 

Proof. For (i) and (Hi), see Gnedenko [16] , Theorem 4 and Theorem 5, respectively. 
For the first part of (ii), see Gnedenko |16j . Theorem 7. For the two choices of auxiliary 
function, see de Haan |17j . Lemma 5 and Resnick [25], Proposition 1.1 (b). That the 
auxiliary function can be taken absolutely continuous with the given property can be 
deduced from Resnick [25], Proposition 1.4 and the proof of Proposition 1.1(a) in the 
same reference. □ 



We note that in the Gumbel case, all auxiliary functions of a given distribution F are 
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tail equivalent. This convenient fact is proved in [23], p. 2(£]. Moreover, if F S MDA(A) 
has unbounded support, then F is rapidly varying, F G RV_oo. 

In this thesis we will not need the full answer to the question of how to find appropriate 
normalizing sequences. For this we refer to Resnick |25| . Proposition 1.11 and 1.13 for 
the Frechet and Weibull domains, respectively, and Proposition 1.1, 1.4 and 1.19 for the 
Gumbel domain. For our purposes the following is sufficient: 

Proposition 3.6 // F 6 MDA(^ / Q ), the normalizations c n = x* — (1/F)*~(n) and 
d n = x* yield G = ^f a in \3.1\) . Here x* is the right endpoint of F. 

Proof. This follows from Proposition 1.13 in Resnick |25| . or can be deduced from the 
proof of Theorem 5 in Gnedenko's paper [16j . □ 



Alternatively one may invert both sides of f)3 . 2f) to obtain {-F* - (F(x)t) — x}/a(x) — > — log£ as x ^ x* . 
If this also holds with a(x) replaced by some a(x), then clearly a(x)/a(x) — > 1. That inverting (|3.2|l is 
allowed can be proven in the same way as Proposition 0.1 in Resnick [25| . 
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Chapter 4 



Tail properties of the Williamson 
d-transform 



We are interested in the extremal behavior of Archimedean copulas, which depends 
on the generator near its right endpoint and near the origin. We prefer, however, to 
work with £i-norm symmetric distributions instead of Archimedean generators, and it 
is thus of interest to connect the extremal behavior of the radial distributions to that of 
the corresponding generators. By Theorem 12.121 this amounts to investigating the tail 
properties of the Williamson d-transform. This program is carried out in the present 
chapter. We separate the analysis into four different cases: regular variation at infinity, 
the Gumbel domain of attraction, the behavior near the right endpoint in the case of 
bounded support, and the behavior at the origin. 

Due to the interpretation of the Williamson d-transform as the survival function of cer- 
tain Beta mixtures, the proofs of some of the results below can be significantly simplified 
using the following result by Breiman, slightly extended to fit our needs: 

Lemma 4.1 Let X and Y be two non-negative random variables such that F G RV_ Q; 
a G (0,oo) ; where F is the distribution function of X, andM\Y a+£ ] < oo for some e > 0. 



IfY& (0,1) a.s., the conclusion holds also with a = and a = oo, where in the latter 
case E[y a ] = by convention. 

Proof. The case when a G (0, oo) is covered by Breiman's original result [3], so we 
suppose that Y G (0, 1) a.s. Denoting by G the distribution function of Y we get, using 
Lemma 11.41 



Then 




x — > oo. 
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Since F is non-increasing and t G (0, 1), the integrand is bounded by one, so we may 
interchange limit and integration by the Dominated Convergence Theorem. If a = 0, 
F(x/t)/F(x) — > 1 as x — > oo. If a = oo, the limit is zero. In either case we clearly reach 
the desired conclusion. □ 

As an aid for the intuition, the reader is encouraged to think of the results in this chapter 
as probabilistic statements about mixture variables, and not only as analytical properties 
of integral transforms. 



4.1 Regular variation to the right, strict case 

A strict Archimedean generator ip, viewed as a survival function, can be expected to 
possess interesting extremal behavior only if ip G RV_ a for some a G (0, oo]; this follows 
from Theorem \3.5\ since any other behavior of ip would exclude it from being in some 
domain of attraction. By definition, ip G RV_ a means that 

l im t^x) = Vc > ^ (4J) 

X-KX3 1p(x) 

Now, by Theorem 12.121 ip has to be d-monotone in order to generate a copula in di- 
mension d. At first sight, this seems to be a fairly strong regularity condition on ip, 
especially for large d, and one wonders if perhaps (|4,ip always holds, at least if we also 
allow a = 0. This question is particularly relevant in view of the fact that all commonly 
encountered families of Archimedean copulas have generators that do satisfy ()4.ip . We 
now give a counterexample to show that this is not always the case. 

Example 4.2 Define 

= l + aBin(log(l+x)) x > 
1 + x 

for some a > to be determined. Then ip(0) = 1 and \im x ^. 00 ^p(x) = 0, and if a < 1, 
then ip(x) > for all x > 0. We show that for any d, one can choose a > such that ip 
is d-monotone. Suppose that for some k G {0, 1, . . . }, 

= (-!)* + + x>0, (4.2) 

where Ck and (fk are constants. Differentiating and using standard trigonometric identi- 
ties one obtains 

^+i)t x \ = i (fe + 1)! + a{k + l)c fc sin(y fc + log(l + x)) - ac k cos(y fc + log(l + x)) 

(l + x) k+2 



(-1) 



^k+i (k + 1)! + QCfc+i sin(y? fc+ i + log(l + x)) 



(1 + x 
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where Ck+i = CkVk 2 + 2k + 2 and v?fc+i = + arctan(l/(/c + 1)). Since J^-£| ) holds for 
k = mt/i Co = 1 and </?o = 0, it follows by induction that it holds for all k G {0, 1, . . . }. 
Since sm(y) > — 1 /or a// y G R, it is clear that given d > 2, {-l) k ^ k \x) > for 
all k = 1, . . . , d — 1, provided < a < k\/ck for each k. Since for k > 2 it holds that 
Cfc > kl and max(co, . . . , c^) = c^, such an a can indeed be found and satisfies a < 1 
(take a = (d — l)\/cd-\), implying that tp is indeed d-monotone. 

We now consider the behavior of ip(cx) /ip(x) as x —> oo along the sequence x n = e nn . 
We write 

^{cx n ) _ 1 + asin(log(l + cx n )) 1 + x n 1 + x n 

ip(x n ) 1 + asin(log(l + x n )) 1 + cx n 1 + cx n 

Since (1 + x n )/(l + cx n ) — > 1/c, we focus attention on A(x n ). With x n as above we get 

l + asin(nvr + log(e- n7r + c)) _ 1 + a(-l) n sin(log(e- n7r + c)) 
~ 1 + a sin(n7r + log(e~ n7r + 1)) ~ 1 + a(-l) n sin(log(e- n7r + 1)) ' 

The denominator converges to one as n — >• oo, 6u£ i/ie numerator does not converge in 
general: along the subsequence where alln are even, it converges to l + asin(logc), while 
if all n are odd, the limit is 1 — a sin(log c) . So unless log c = rmr for some integer m, 
ip(cx n )/?p(x n ) does not converge. 

We remark that there are known families of Archimedean copulas where the generators 
satisfy ()4. 1[) with a = 0. One such example is ip(x) = 6/\og{e e + x) with 9 G (0,oo), 
which generates bivariate Archimedean copulas (see Nelsen |22| . item 19 of Table 4.1, 
where however the inverse generator cj){t) = e Q l l — e 9 is given instead of if)). In such 
cases it is still possible to draw conclusions about the tail behavior of the copula, see 
Charpentiers and Segers [6], although the corresponding ^i-norm symmetric distributions 
do not possess non-degenerate extremal behavior (under affine normalizations). 

After this introductory discussion, we turn to sufficient and in most cases necessary 
conditions on the radial part of the corresponding ^i-norm symmetric distribution for 
gH) to hold. 

Proposition 4.3 Let F be a distribution function with -F(O) = and let tp = WdF, 
d>2, be its Williamson d-transform. Then 

(i) If F G RV- a , a G [0,oo) ; then if) G RV_ Q . 

(ii) Ifipe RV- a , a G (0,oo) ; then F G RV_ Q . 

Proof. Suppose R ~ F and let S\ be Beta(l,d — 1) distributed and independent of 
R. Then, as we have seen (e.g. in Theorem I2.12p . ip{x) = ¥[RS\ > x\. Observing that 
Si G (0,1) a.s., we obtain from Lemma 14.11 that F(x) ~ B\ip(x) as x — > oo for some 
B\ > 0, whence the statement (i). 

For part (ii), we have by assumption that ip(x) = x~ a L(x) for some a > and slowly 
varying L. We first show by induction that, for k = 0, 1, . . . , d — 1, 

(-l) k x k il)^(x) ~ c k x~ a L(x) = c k tp(x), x -)■ oo, 
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where cq = 1 and c& = n^LoC? + a ) f° r k = 1, . . . ,d — I. For k = the statement is 
trivially true. For k = 0, . . . , d — 2, 

/•oo 

^(*)( x ) = - / v (fc+1) (y)%, 



since as x — )• oo. Moreover, ^/A fc+1 ) is monotone since if) is (i-monotone, 

and ^ k \x) ~ (— l) fc CjfcX _fe_Q L(x) by the induction assumption. The Monotone Density 
Theorem now implies that ^ k+1 \x) ~ (— l) k Ck(— k — a)x~ k+1 ~ a L(x) , which is precisely 
the statement we wanted to prove. 

Inserting this into the inverse Williamson (i-transform yields 

F(x) _ l-2rr^V(x) 1 {-l) k x k ^ k \x) ^1 

k — k — 

The limit is clearly positive since a > 0. □ 
Remark. One can show that 

d-l d-l fc-1 d-l 

Es*=i+Esn&-+<«)=n(i+i). 

k=0 k=l j=0 k=l 

which by Breiman's result is equal to 1/E[5f ]. As a by-product we thus get an expression 
for the raw moments of Beta(l, d — 1) distributed random variables. 



4.2 The Gumbel domain of attraction 

If F € RV_oo, which for instance is the case if F € MDA(A) and has unbounded support, 
we may use Lemma 14.11 similarly as in the proof of Proposition 14.31 (i) to conclude that 
ip(x)/F(x) — > as x — > oo. This gives no information about the precise behavior of the 
tail of ip. In what follows we derive more precise results in the case where F and/or ip is 
in the Gumbel domain of attraction. This is the relevant case in the context of extreme 
value theory. It turns out to be unnecessary to distinguish between the strict and the 
non-strict cases. 

We proceed by proving a number of intermediate results. The condition (|3.2|) from 
Theorem 13.51 will appear several times, and we emphasize that the requirements on the 
auxiliary function a are part of this condition. Namely, a(x) > and a(x)/x — > as x 
tends to the right endpoint x* (which may be infinity). 

Lemma 4.4 Suppose the non-negative, non-increasing function f satisfies 113. ty) . i.e. 

lim f±+?m = e -*, vt e R, 

x^x* f(x) 
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where a(x) > and a(x)/x — > as x f x* . If f has a density f that is monotone in a 
neighborhood of x* , then 

f'(x + a(x)t) f w m 
lim I v . v 1 1 = e~\ Vi e R, 

Proof. The core of the argument is similar to the proof of Theorem 3.10.11 in Bingham 
et al. For all x large enough, x + a(x)t = x(l +ta{x)/x) is close to x* for any t E R, 
since a{x)/x — > as x f x*. Now choose any u,« £ R, ti > d. For all x sufficiently 
close to x* , say x S (xo, x*), we thus have that x + a(x)v and x + a(x)n are both in the 
region where /' is monotone. Since / is non-increasing, — /' is non-negative. We look at 
the case where — /' is non- increasing near x* , the non-decreasing case being completely 
analogous. We obtain 



f(x + ei(x)u) — f(x + a(x)v) a(x) 

f(x) f(x) J v 



-f'{x + a(x)t)dt 



< -^r\f'(x + a(x)v)(u-v), 
fix) 

where each expression is non-negative. The left side tends to e~ v — e~ u as x — > x* , and 
hence 

Qj(x) e~ v — e~~ u 
liminf — 7rrf'(x + a(x)v) > lim = e~ v , 

x^x* /(x) ulv U — V 

where the last equality follows by e.g. l'Hopital's rule. Similarly one shows 

Qj{ X ) 

limsup --^r-rfix + a(x)u) < e~ u , 

x^x* f{X) 

and so 

lim -^\f'(x + a(x)t) = e~ t , Vf € R. (4.3) 

x-)x* f{X) 

In particular, we may take t = to obtain —f{x)/a{x) ~ f'(x), x — > x* , which implies 
that /' is non-zero near x*. Substituting back into (|4.3p gives the conclusion. □ 

Lemma 4.5 Suppose the function f satisfies A3.2\) . Then, for any p E R, g(x) = x p f(x) 
also satisfies with the same auxiliary function. 

Proof. A simple computation yields 

g(x + a(x)t) _ f 1+ a ( x \Y f(x + a(x)t) _^ 



g(x) V x J f{x) 

since / satisfies (|3.2p . which in particular means that a(x)/x — )• as x — >• x*. □ 

Lemma 4.6 Suppose the non-negative, non-increasing function f satisfies i3.2\) . Then 
g(x) = f[t)dt also satisfies 113. S\) with the same auxiliary function. 
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Proof. Suppose a is the auxiliary function of /. By Theorem 13,51 we can find another 
auxiliary function a that is absolutely continuous with a'{x) — >• 0, x — > x*. L'Hopital's 
rule then gives 

g(x + a(x)t) — f(x + a(x)t). .. , , 

lim ^ . \ 1 1 = lim v x (1 + a'(x)t). 

The limit on the right is clearly e~* by the properties of / and a. So g satisfies (|3.2p with 
auxiliary function a, hence also with a, since the two are tail equivalent, i.e. a(x)/a{x) — > 
1, x -> x*. □ 



Lemma 4.7 Let F be a distribution function with F(0) = and Zet i/> = SU^i 7 . T/ien 
^(d-i) j S differentiable if and only if F is differentiable, and in this case 

F'( x ) = (~ 1 ) <f ~ 1 ^-i,/,W< 



(rf-l)I 



Proof. From the inversion formula of the Williamson (i-transform in Proposition I2.5[ 
it is clear that ■0( ci ~ 1 ) is differentiable if and only if F is, and that the derivative F' in 
this case is given by F'(x) = fd(x), where we define 



fnix) := D 



n-l 



.k=0 



for all n such that i/j( n ) exists. Here D[-] denotes differentiation. We are done if we can 
show that for any such n, 



(-l) n_1 
= (n-l)l xT 



"V (n) (x). 



(4.4) 



We argue by induction over n. For n = 2, f n (x) = D[ip(x) — xip'{x)\ = —xip"(x), so 
(|4.4p holds. Suppose (|4.4p holds for n — 1 and exists. Then 



/„(*) = / n _i(x) + L> 



1) 



n-l 



(n-l)! 



-V (n_1) 0> 



\n— 1 



(n-l) 



-x n -V (n) (^)- 



□ 



We are now ready to prove the following proposition concerning the connection between 
F and ip when either is in the Gumbel domain of attraction. Note the assumption that 
F or ip be differentiable — we are not treating the most general case here. 
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Proposition 4.8 Let F be a distribution function with F(0) = 0, let t/j = WdF for d > 2 
be its Williamson d-transform, and suppose that either F or tp( d ~ 1S) is differentiate. 
Both are then differentiate, and if we in addition assume that F' is monotone in a 
neighborhood of its right endpoint x* , then F £ MDA(A) if and only if t/j £ MDA(A). 
In this case, 

a{x) = a{x) = 

are possible choices of auxiliary function for both F and if). 

Proof. Lemma 14.71 implies that both F and if) are differentiable and satisfy 

F'(x) = | x d -V (d) (g)- (4.5) 

We start with necessity, so suppose F is in the Gumbel domain. Theorem 13.51 and the 
monotonicity assumption on F' = —F' then enable us to use Lemma f4.4l to conclude that 
—F', and hence (— l) d x d ~ 1 ^'^ , satisfies (|3.2p . The auxiliary function carries over, and 
Lemma 14.51 implies that (— l) d if)^ also satisfies (|3.2p with the same auxiliary function. 
Since 

poo 

{-l) k ^ (k \x) = / {-l) k+l ^ k+l \t)dt 



for all k = 0, 1, . . . , d — 1, repeated use of Lemma I4TB1 yields that ip satisfies (|3.2p . again 
with the same auxiliary function. Hence ip £ MDA(A). 

To prove sufficiency the procedure is simply reversed. In this direction the use of Lemma 
14.41 is justified since if>^ are monotone by <i-monotonicity for k = I, ... ,d — 2, and also 
for k = d — 1 since is convex. For k = d, we use (|4.5p together with the eventual 

monotonicity of F', which implies that tj}^ also is eventually monotone — indeed, suppose 
f(x) = x p g(x) for some functions / and g on (0, x*), where p > 0. If for any xq < x* one 
can find x,y £ (xq,x*), x < y, such that g(x) < g(y), then / cannot be non-increasing 
on any left neighborhood of x* , since clearly f{x) < f{y). With / = —F' and g = -^ d \ 
this gives a contradiction if ip^ is not eventually monotone. □ 



4.3 Regular variation to the right, non-strict case 

We start with a result that describes how non-strict d-monotone Archimedean gener- 
ators and their derivatives behave in case they are regularly varying near the right 
endpoint. 

Proposition 4.9 Let if) = 2Bd-F for some distribution function F with F(0) = 0, and 
suppose the right endpoint x* = ^ _1 (0) is finite. 
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(i) Iftp(x* - x -1 ) G RV_ Q , a > 0, then 

(-lf^)(x*-x- l )eRV- a+k 

for k = 1, . . . , d — 1. Moreover, in this case a > d — 1. 

(ii) If (-l^V^O* - x- 1 ) G RV- a , a > 0, i/ien 

(-i)V (fc) (^-OGRV_ a _ (d _ 1)+fc 

/or A; = 0,...,d-2. 

Proof. We first prove (i). Since is continuous for fc = 0, . . . , d — 2, tp^ k \x*) = 
for these fc. Hence for fc = 1, . . . , d — 1, 

roo 

_ x -i) = _ / r 2 ^ k \x* - r x )dt. (4.6) 

J X 

Note that is in general only a density, not a derivative, of ip( d 2 -* (see the comment 

after Definition I2.3j) . This causes no extra difficulties, since ifj^ d ^(x) = — J x g(s)ds 
implies (|4.6p for k = d — 1 upon changing variables to t := (x* — s) . We now proceed 
by induction over k. The induction assumption is that, as x — > oo, 

_ £-1) ^ Cfc _ lX - Q + fc - 1 L(x), -a + fc - 1< and c fc _i ^ 0. 

Here L is some slowly varying function. The Monotone Density Theorem and (|4.6|) then 
imply that — x~ 2 tp( k \x* — x _1 ) ~ (a — k + l)ck-ix~~ a+k ~ 2 L(x) , or, equivalently, 

^ k \x* - x" 1 ) ~ c k x- a+k L(x), (4.7) 

where = (—a + A; — l)cfc_i 7^ 0. 

To complete the induction step, we must show that —a + k < 0. So suppose first that 
—a + k > 0. Then (|4.7p implies that (— l) fc '^( fc )(x) — > +00 as x — > x* , which contradicts 
ci-monotonicity. Hence — a + k < for k = 1, . . . ,d — 1, which in particular gives the 
last statement in part (i). 

We still have to exclude — a + k = 0. However, it is enough to show this for k = 
1, . . . , d — 2, since this still would allow us to reach (|4.7j) for & = d— 1 and c^-i 7^ 0, but 
possibly with —a + cZ — 1 = 0. So suppose —a + k = for any fc G {1, . . . , d — 2}. Then 
(— l) k ip( k \x* — x _1 ) G -RVo, but at the same time d-monotonicity implies that (— l) k ip^ 
is convex for k = 1, . . . , d — 2, and thus in particular 

{-i) k 4 k) U - \x-^ = (-i) fc v (fc) Q(x* - x- 1 ) + ~ X A 

< (_l)k^(*)( x * _ x -l) + (_l)*^(fc) (a .*) 
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since tp^(x*) = 0. Equivalently, 

2 < 



ip( k )(x* - (2a:)- 



But here the right side tends to one as x — > oo since (_l)*^(*0(a;* - x- 1 ) £ RV , so the 
inequality is eventually violated. This gives the desired contradiction. 

The statement that was to be shown now follows upon noting that the induction as- 
sumption is satisfied for k = 1 by hypothesis. 

The proof of (ii) is simpler. This time the induction assumption is 

(-l)V fc V - x- 1 ) G RV- a - id - 1)+k , 

where k < d — 1. It clearly holds for k = d — 1 by hypothesis. To prove the induction 
step, we use (|4.6|) together with the induction assumption to obtain 



oo roo 



(-l)*-V ( *~ 1) (a;*-x~ 1 )= / t- 2 (-l)*^*)(x*-t- 1 )dt= / t- l ~ a - d+k L{t)dt 

J X J X 

for some slowly varying L. Since a + d — k > 0, Karamata's Theorem gives 

(-i)*-y*-i)( x * _ x- 1 ) L-^-a-^D+ck-i)^), 

a + d — k 

and so (-l^-V^V " O G RV_ a _ ((i _i) +{ifc _i). □ 

Note that for a G (0, oo), Proposition 14.91 says in particular that il){x* — x~ l ) is in 
RV_ Q _d + i if and only if — x" 1 ) is in RV_ Q . We use this fact to prove the 

following connection between the generator ip and the radial part R ~ F. As in the 
case of rapid variation, Section 14.21 we restrict ourselves to differentiable distribution 
functions. 

Proposition 4.10 Let F be a distribution function with F(0) = and right endpoint 
x* < oo, let ip = WdF for d > 2 be its Williamson d-transform, and suppose that either 
F or ipv^—i) is differentiable. Then, for a > 0, 

F{x* - x~ l ) G RV_ a if and only if ip(x* - x~ l ) G RV_ a+d _i. 

Proof. We prove the "only if" part. By Lemma |4.7| both F and ifj( d ~^ are differen- 
tiable, and hence 

poo 

F(x* - X' 1 ) = - / t- 2 F'(x* - r^dt, 



since F(x*) = 0. By the Monotone Density Theorem, —F'(x* — x 1 ) G RV_ a +i. Lemma 
14.71 says that 

-F'(x* - x~ l ) = - 1 (s* - aT 1 )*-^-!) V°(a>* - x' 1 ), 
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which implies that also (— l) d ip( d ' (x* — x 1 ) is in RV_ Q +i, since (x* — x 1 ) d 1 tends to 
a constant and thus is slowly varying. Now, 

poo 

(_i)<*-i^-i) (x * _ x -i) = / t -2(_i)«*^(«o ( x * _ r 1 )*, 

where the right side is equal to t~ a ~ 1 L(t)dt for some slowly varying L, due the regular 
variation of (—l) d if)^(x*—x~ 1 ). Karamata's Theorem then implies that (—l) d ~ 1 if)^ d ~ 1 \x* — 
x^ 1 ) G RV_ a , and we may apply Proposition 14.91 to reach the desired conclusion. 

To prove the converse, one argues analogously, but in the reverse order. First one estab- 
lishes regular variation of if>^ d ~ l \x* — x~ l ) using Proposition ^. 91 By the differentiability 
assumption, one can differentiate once more and use the Monotone Density Theorem to- 
gether with Lemma f4.7l to obtain regular variation of —F'(x*—x~ l ). Karamata's theorem 
then gives the conclusion about F. □ 

The intuition behind this result is that when F (and hence if)) has a finite right endpoint 
x*, then the asymptotic behavior of F near x* is the same as the asymptotic behavior of 
near x* . This is different from the case when x* is infinite — then F and if) have 
the same asymptotics, according to Proposition 14.31 



4.4 Behavior at the origin 

We conclude the investigation of the asymptotics of the Williamson d-transform by 
considering the properties near the origin. 

Proposition 4.11 Let R be a random variable on (0, oo) with Williamson d-transform 
ip. Then 

(i) If l/R e RV- a , a G (0, 1), then 1 - ^(x' 1 ) G RV_ a . 

(ii) Ifl/R G RV_ a , a G [l,oo], then 1 - ip{x~ l ) G RV_i. 
(Hi) If 1 - if){x- 1 ) G RV_ Q , a G (0, 1), then l/R G RV_ a . 

Proof. Let S\ be a Beta(l, d— 1) distributed random variable independent of R. Since 
Si is continuous we have 

P[l/Si >x]= P[Si < 1/z] = 1 - (1 - x' 1 )^ 1 ~ x' 1 , x -»• oo, 

which implies that 1/S\ G RV_i. Furthermore, it follows from the discussion after 
Proposition 12.51 that 1 — if>(x~ 1 ) is the survival function of l/(RSi). 

To show (i), we pick an e > such that a + e < 1, which is possible since a < 1. 
Then because l/5i G RV_i, E[(l/S'i) Q + e ] < oo. Now, l/R G RV_ Q by assumption, 
so Breiman's result, Lemma 14. 1| implies that 1/(RS\) G RV_ Q , or, in other words, 

1 - VKO G RV_ Q . 
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As for (ii), consider first the case when a G (l,oo]. Then because 1/R G RV_ a , 
E[(l/i?) 1+e ] < oo whenever e is such that 1 + s < a. One may then use Breiman's 
result again to conclude that 1/(RS\) G RV_i, i.e. 1 — Y>(x _1 ) € RV_i. If a = 1, 
Breiman's result is not applicable. Instead we show explicitly that in this case, 

T I \ 1 _ ^(lAO Irs \ 

L(x) := ;) 1 \ x G 0, oo), 

1/x 

is slowly varying. To do so, first use the definition of the Williamson ci-transform and 
the binomial expansion theorem to write 

L ( x ) = -Il( d ~ k 1 ) (-l) fe ^ fc+1 I£[irt {R>1M ], x G (0, oo). 
k=i ^ ' 

Further, denote by G the survival function of 1/R and integrate by parts to obtain 

E[R- k l {R>1/x} ] = - f X y k dG(y) = -x k G(x) + k f G{y)y k - 1 dy. 
Jo Jo 

Thus L(x) may be expressed as L(x) = A(x){Bi(x) + • • • + Bd-i(x)} where 
A(x) = £ G(y)dy, B x (x) = (d - 1) |l - ^} 

and 

Karamata's Theorem now implies that 

lim ^7 \ =0 ' lim — krit \ = 1 7' k = 2,...,(d-l). 

x-^oo A[X) x^oo x K G(x) k — 1 

Thus as x — > oo, B\(x) — > d — 1 while B)~[[x) — > for fc = 2, . . . , (d — 1). Since A(x) is 
slowly varying by Karamata's Theorem, L(x) is a product of two slowly varying functions 
and therefore itself slowly varying. 

Part (in) is based on the Monotone Density Theorem. Suppose 1 — il)(x~ l ) = x~ a L(x), 
with L slowly varying. Since i/}(0) = 1 we get 

fl/x roo 

1 - = - / ^(y)dy = - / y' 2 ^(y- l )dy. 

JO Jx 

The Monotone Density Theorem gives —x~ 1 tp'(x~ 1 ) ~ ax~ a L(x) as x — > oo. Suppose 
now that (— l) fc x~ fc ^( fc )(x~ 1 ) ~ CkX~ a L(x) for some 1 < A; < d— 2, where c& is a constant. 
Since ip( k \x) = — ip ( - k+1 \y)dy we get, after changing variables, 

(-1) W 1 ) = ! X (-l) k+l ^ k+1) (y- l )y- 2 d y . 
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By the Monotone Density Theorem, {—l) k+1 ip^ k+1 > {x l )x 2 ~ Cfc(— a + k)x a+k 1 L(x), 
and so by induction we obtain, for k = 0, 1, . . . , d — 1, 

{-ifx^^Xx- 1 ) ~c k x a L{x), x^oo, 

where the c& satisfy cq = 1, c\ = a and Ck+i = (k — a)c/% > 0. The inversion formula for 
the Williamson (i-transform now yields 

n.- 1 ) = ■-»-vE ( -"T""' 

fc=i 

~ Tyc fc x a L(x) = £(1 - ^(x -1 )), x^oo, 

fc=0 

where the constant B is positive since each is. □ 



Suppose that ip is a <i-monotone Archimedean generator. In particular, this implies that 
?/>(x) := ip{— x) is a distribution function on (— oo,0]. Our final result shows that d- 
monotonicity of ip restricts the possible extremal behavior of ip. In particular, it shows 
that Proposition ^. lll indeed covers (almost) all relevant cases (the exception being a = 1 
in part {Hi), which is not covered). 

Proposition 4.12 Letip be ad-monotone Archimedean generator andip as above. Then 

(i) If 1 - ^(x- 1 ) G RV_ Q , then a G [0, 1]. 

(ii) If the extremal behavior of ip is non- degenerate, then ip is in the maximum domain 
of attraction of the Weibull distribution with the parameter range restricted to 
(0,1]. 

Proof. Part (i) stems from the fact that if a > 1, 

llm 1-^/2)^-^1 
xio 1 — ip{x) 2 

In particular, therefore, there exists an xo 6 (0, oo) such that ip{x/2) > {ip{0) + ip{x)}/2 
whenever x G (0, xq). But tp is <i-monotone and hence convex, so we obtain a contradic- 
tion. 

Turning to {ii), observe that since the upper endpoint of ip is finite, ip cannot be in the 
Frechet domain of attraction. Furthermore, a necessary condition for ip G MDA(A) is 
that 1 — -^(x -1 ) G RV-oo, which is excluded by {i). So ip is in the Weibull domain. This 
implies in particular that 1 — ip{x~ 1 ) G RV_ a for some a G (0, oo). Due to {i) however, 
a > 1 is not possible. □ 
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4.5 Conditions on the inverse generator 



The most common approach found in the literature for dealing with Archimedean copulas 
is to express them in terms of the inverse generator <fi = tp . We have already seen 
some of the virtues of working with the generator ip instead, and the advantages will 
become even more apparent later on. However, in order to remain compatible with 
existing conventions we here give equivalent conditions on <p for the results in Sections 

Proposition 4.13 Let ip be an Archimedean generator, and let (p = ip' 1 be the inverse 
generator. 

(i) Tp G RV_ a , a G (0,oo) if and only if (p{l/s) G RVi/q,, or, equivalently, 

l im ^££) =c -iA* for all c>0. 
40 (p{s) 



(ii) ip(x* — x x ) G RV_ a; a G [d — 1, 00), if and only if (x* — <p(l/s)) 1 G RV ly / a , or, 
equivalently, 

l im x * ~ ^s) = c i/ a for aU c > Q _ 

siO X* — (j)(s) 

Here x* is the right endpoint of ip. 
(Hi) l — ip(x~ 1 ) G RV_ a , a G (0,1], if and only if cp(l — s^ 1 ) G RV_ 1 / a , or, equivalently, 

lim ^ ~ CS } = c l ' a for all c>0. 

slO 0(1 — Sj 

(iv) ip G MDA(A) if and only if <p{\/s) is in de Haan's H-clas^. The auxiliary function 
a(x) of ip{x) and c(s) of (p(l/s) are related through c(s) = a{(p{l/s)) and a{x) = 
<l/ip{x)). 

To simplify matters, we need the following adaptation of a theorem on inverses of regu- 
larly varying functions: 



Lemma 4.14 Suppose f is invertible and a G (0, 00). Then f G RV_ Q , a > if and 

l/c 



only if f 1 (s 1 ) G EVi/ a , or, equivalently, 



lim ^ \^ S ) = c" 1/Q , for all c> 0. 
40 f~ l (s) 



X A non-negative, non-decreasing function g is in de Haan's II-class if lim s _ >00 {gi(ts) — b(s)}/c(s) — log t, 
t £ K, for some functions b(s) £ R and c(s) > 0. The function c(s) is called the auxiliary function of g. 
See Resnick 25], Section 0.4.3 for an concise account of the relevant theory. 
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Proof. f G RV_ Q if and only if 1/f G RV a . An easy computation shows that 
[l//] _1 (s) = which by Theorem 1.5.12 in Bingham et al. [2] is equivalent to 

G RVi/q,. The last equivalent formulation follows directly from the definition 
of regular variation upon changing the variable from s to s . □ 



Proof. (of Proposition \4-13 ) Part (i) follows directly from Lemma 14.141 since ip is 
strict and hence invertible. For Part (m), note that x* — x" 1 G > 0}. On this set 
ip is invertible, and the inverse of if){x* — is l/(x* — 0(s -1 )). Now apply Lemma 
14.141 Part (Hi) follows in the same fashion since ?/>(x -1 ) is invertible for x sufficiently 
large, with inverse (j)(l — For Part (iv), Proposition 0.10 in Resnick {25] says that 

ip G MDA(A) if and only if (l/ip)^(s) = inf{x > | l/ip(x) > s} is in de Haan's Li-class. 
But since ip is continuous, we have inf{x > | l/ip(x) > s} = inf{x > | l/ip(x) = s}, 
which is equal to </>(s -1 ) by definition. The relationship between the auxiliary functions 
follows from Proposition 0.9 in [25], where, in order to obtain the expression for a(x), we 
used that [c/)(l / s)]*" = since <j) is a right inverse of ip: (j)(l/s) = x implies 1/s = i(j(x). 
□ 
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Chapter 5 

Coefficients of tail dependence 



5.1 Coefficient of the first kind 



An important problem in probability theory and statistics is to find simple measures, 
preferably scalar, that quantify the strength of dependence between random variables. 
In the context of extremes, one may ask about the likelihood of observing large values 
in one component X\ given that another component X 2 is large. One such measure is 
the coefficient of (upper) tail dependence X u , defined as 

X u := limPpfi > Fj-(q) | X 2 > Ff(q)}, 

where F\ and F2 are the marginal distribution functions of X\ and X2, respectively, and 
the limit is assumed to exist (this is not always the case!). The coefficient of lower tail 
dependence, A;, is defined analogously: 

\i : = limPLYi < F«-(q) \ X 2 < Ff(q)}. 

Note that dependence is measured on a quantile scale. It is therefore not surprising that 
these measures can be expressed in terms of the associated copula C. Indeed, using the 
definition of the generalized inverse, one easily checks the validity of 

P[Xi < F^q) I X 2 < Ffiq)] = < q, F 2 (X 2 ) < q] = 

[ 1 _ 1 - 2 WJ P[F 2 {X 2 ) < q] q 

Similarly one can show that A M also depends only on the copula. In the case where C is 
Archimedean, X u and A; have simple forms: 

Proposition 5.1 Provided the limits exist, the coefficients of upper and lower tail de- 
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pendence of an Archimedean copula with generator tfj are given by 

, 1 - ib(2t) 
X u = 2-lim 7 J 

\ lim^oo if V is strict 

1 0, otherwise 

Proof. Let (Ui,...,Ud) be distributed according to C. Due to exchangeability, all 
two-dimensional marginals are given by C(ui,U2, 1, . . . , 1) = il)(ijj~ l (ui) + il)~ 1 (u2))- The 
expression for \ u and the fact that the marginals are uniform yield 

a„ = luIl nu 1 > q ,u 2 > q] 



F[U 2 > q] 

lim 1 ~ F ^ Ul = ^ ~ P ^ 2 ~ g] + F[[/l ~ 9 ' U2 ~ g] 
9™ 1 - P[C/ 2 < q] 

= 2-lim -V^- 

t^o 1 - <^(t) 

Here we used that q = ip o and that i := ip~ 1 (q) — > as q — )■ 1. Similarly, 

= Bm lg^«.^«l = B m w 1 h)) = lim *30 

If -0 is strict, then ip~ (0) = oo and we get the result. If is non-strict, then c := 
i/> (0) < oo. So for all t G [c/2,c), the numerator ip(2t) is zero, while the denominator 
is non-zero. Hence the limit must be zero. □ 



Note that the generator in Example 14.21 provides an example where the index of lower 
tail dependence does not exist. However, the results from Section 0] allow us to give 
sufficient conditions for the existence of the coefficients of tail dependence and compute 
their values in terms of the corresponding radial part R. 

Corollary 5.2 Suppose the Archimedean copula C has generator ip, and let R ~ F = 
SH^ip be the corresponding radial distribution. 

(i) If 1/R G RV_ a , a G (0,oo), then X u = 2 - 2? , where /3 = a A 1. 

(ii) If Re RV_ Q; a G [0,oo), then \ = 2" a . 

Proof. For (i), we observe that 1 — ip(x~ 1 ) G RV_^ according to Proposition 14.111 and 
hence 

Hm 1 -*??- lim l-'Wf)-')^ 

t^0 1 — 1p(t) x^oo l — ty{X~ L ) 

by definition of regular variation. The expression in Proposition 15.11 for X u gives the 
result. For (ii), the result is immediate from Proposition 14. 3\ the definition of regular 
variation, and Proposition 15.11 □ 
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We remark that this result does not give conditions on R ensuring that X u = 1. This is 
due to the limitation a > in Proposition 14.111 (i). 

If ip G RV_oo, then Proposition 15,11 clearly implies that A; = 0. Proposition 14.81 gives 
one sufficient condition for this to happen, namely that the corresponding radial part R 
is in the Gumbel domain of attraction and has differentiable distribution function with 
eventually monotone derivative. 

Note that the converse of Corollary 15.21 does not hold — knowing e.g. A; only gives infor- 
mation about the limit of ip{cx) /ip{x) for c = 2, which is not enough to deduce regular 
variation of ip (and thus of R). The situation is analogous for X u . 

A natural extension of the coefficients of tail dependence for high-dimensional distribu- 
tions is 

Ap n := HmPpT! < Ff(q), ...,X m < F^(q) \ X m+1 < F^ +1 (q), . . .,X m+n < F^ +n (q)], 

q— i>0 

and analogously for A™' n . As in Proposition 15 . 1 1 one shows that for Archimedean copulas, 
we have the expression 

x m,n r ip({m + n)t) 
A, = hm — — 

t^oo ip(nt) 

if ip is strict. If ip is regularly varying with index —a, a G [0, oo) (which happens if R 
has the same property), we conclude as before that 



The expression for X u is possible to derive, but has a more complicated form since 
its derivation requires that survival functions of the copula C be expressed in terms 
of marginal distribution functions of different dimension. This can be done using the 
so-called sieve formula, but the calculations are cumbersome. 

As a final observation we note that although there is no hope of a finding a converse to 
Corollary 15.21 the situation is in a sense different for A™' n . For, if we know that A™' n has 
the form (|5.1|) for all m,n G N, then ip must be regularly varying with index —a. This 
follows from Theorem 1.4.3 in Bingham et al. [2J, which says that ip G RV_ Q if and only 
if ip{cx)/ip{x) — > c~ a for all c in a dense subset of (0, oo), provided the condition 

limsup^*(c) < 1, where ip*(c) := limsup ip(cx)/ip(x), 
cJ4 z-s-oo 

is satisfied. But this automatically holds since ip is decreasing and hence ip*(c) < 1 for 
all c > 1. Furthermore, although f)5. 1 j) only gives ip(cx)/ip(x) —> c~ a for all rational c in 
(l,oo), this is enough since for < c < 1 rational, it implies 

lim ip(cx)/ip(x) = lim ip(x)/ip(x/c) = \ lim ip(x/c)/ip(x)\ = c~ a . 

x— >ao x— ¥oo Vx— ¥oo J 
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Of course, this way of inferring properties of tp, and hence R, based on the behavior of 
A, ' is of little practical importance since all these coefficients must satisfy (|5.ip . The 
corresponding estimation problem is obviously infeasible. Moreover, the interpretation of 
A™' n as describing multivariate marginal tail behavior only remains valid if ifi generates 
a copula in any dimension, and thus is completely monotone. 



5.2 Coefficient of the second kind 

When a coefficient of tail dependence is zero, this is interpreted as asymptotic indepen- 
dence of the tails of the marginals, in the sense that unlikely events tend to occur more 
or less independently of each other. The "more or less" part is of course not completely 
satisfactory, and supplementary measures of tail dependence have been suggested. One 
such measure was introduced by Coles et al. [7J, and following Demarta [Hj we call it the 
(upper) coefficient of tail dependence of the second kind. It is defined as 

t _ lim log (nxi > Fr(g)nx2 > gtM) 1 

u <^o log (¥[Xi > Ff (q) , X 2 > F+- (q)} ) 

with the lower coefficient A; being defined analogously. These quantities compare, on a 
logarithmic scale, the joint exceedance probabilities under the actual dependence struc- 
ture with the exceedance probabilities one would observe if the components were inde- 
pendent. Expressed in terms of the copula C of (Xi, X2), the tail dependence coefficients 
of the second kind are 

X u = lim - — ; — -J7 — — 1 and A; = lim — f - , — 1. (5.2) 

q ->llog(l-2q + C(q,q)) q ^ologC(q,q) V ' 

We first consider the lower coefficient A;. It turns out that when ip G RV_ Q for a > 0, 
and hence A/ > 0, A; gives no additional information: 

Proposition 5.3 Suppose ip G RV_ Q , a > 0. Then A; = 1. 

Proof. Change the variable to x := ip~ 1 (q) i n dS2J) and apply L'Hopital's rule: 

21ogg ,. 21ogV(x) ip(2x)ib'(2x) 
lim — r = lim — — f = hm 



q^O log C(q, q) x-J-oo log^(2x) x-^oo ip(x) \j}'{x) 

Now, 1/)' is monotone by ti-monotonicity, and ip(x) = — f°° rj)'(t)dt. Since a ^ 0, the 
Monotone Density Theorem implies that —ip' G RV_ a _i, and so the limit above is equal 
to 2- a /2- a - 1 = 2. By (^]) . then, X t = 1. □ 

In fact one can easily show that A; is always one whenever A; > (see Demarta [8J, 
Lemma 2.1 {Hi) for the corresponding property in the case of A u ). If C is Archimedean 



x This term has several different meanings depending on the context. Here we use it a vague sense to 
support the intuition about the coefficients of tail dependence. 
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with strict generator tfi, and ip is in the Gumbel domain of attraction (and hence in 
RV-oo, implying that A; = 0), we can be more precise about the behavior of A/. 

Proposition 5.4 Suppose C is a bivariate Archimedean copula whose generator ip is 
strict and in the Gumbel domain of attraction with auxiliary function a. If a £ RV/j, 
then 

Xi = 2-P - 1. 
Proof. As in the preceding proof we obtain 

lim 21ogg = i im ^'foOMs) 

q-¥0logC(q,q) x-*oc tj)'(2x)/il)(2x) ' 

By Theorem 13.51 and the subsequent remark, we have a{x) ~ —ip(x)/i/j'(x), whence the 
result follows due to the assumption on a and the definition of regular variation. □ 

These results are in line with the intention that A/ should allow to differentiate between 
different degrees of asymptotic independence (in the above vague sense). The conditions 
on t/j in Proposition 15.41 can immediately be translated into conditions on the radial part 
R using Proposition 14.81 

As mentioned above, there is a counterpart to Proposition 15.31 for the upper coefficient 
X u . More precisely, X u = 1 as soon as X u > 0. A sufficient condition for this to 
happen is, according to Corollary 15. 2 \ that the radial part R corresponding to ip satisfies 
l/R E RV_ Q with a E (0, 1). We have not been able to find a counterpart to Proposition 
15.41 i.e. to deduce what happens in the case a E [l,oo), and hence X u = 0. 

We conclude this chapter with a short discussion on the relationship between distri- 
butions, their copulas, survival copulas, and coefficients of tail dependence. The tail 
dependence coefficients are sometimes defined as a quantity relating to the distribution 
F of a random vector, but having the property that it only depends on the (ordinary 
or survival) copula of that distribution. That is how the concept was approached here. 
However, many authors define the tail dependence coefficients as quantities relating di- 
rectly to a particular copula C, regardless of whether this copula in a specific modeling 
context appears as the (ordinary) copula of a distribution under consideration, or as its 
survival copula. This latter convention has the consequence that if X u , say, is the upper 
tail dependence coefficient of C, and C is the (ordinary) copula of F, then X u describes 
the upper tail of F. If, on the other hand, C is the survival copula of F, then A n describes 
the lower tail of F. When dealing with Archimedean copulas, this can become confusing 
since these copulas appear as the survival copulas of £i-norm symmetric distributions. 
Hence the quantities Aj and Az treated in this chapter both describe the upper tail of 
the corresponding /i-norm symmetric distribution, while X u and A u describe its lower 
tail. 
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Chapter 6 



Extreme value limits of 
Archimedean copulas 

A d-dimensional copula C with the property that C n (u l l n ) = C(u) for all n > and 
u E [0, l] d is called extreme value (EV) copula. These copulas arise when one considers 
maxima (or, as we will see, minima) of multivariate distributions. It is well-known (see 
Genest and Rivest |15j ) that if C is Archimedean, it belongs to the Gumbel family, i.e. its 
generator is ip£ n (x) = exp(— x 1 ^) for some > 1. The language of £i-norm symmetric 
distributions makes it simple to understand why this is to be expected. This will be 
apparent in Section 16.11 where we consider the limiting distributions of componentwise 
minima of ^i-norm symmetric distributions. In Section [6.21 we consider the maxima of 
these distributions. Here the limit turns out to be the so-called Galambos copula (see 
PU and 0). 

6.1 Componentwise minima of £i-norm symmetric distri- 
butions 

Consider an i.i.d. sequence X\, X%, ... of M. d - valued random vectors. The corresponding 
componentwise n-block minima are defined as 

W n = (Wi,„, . . . , W d .n) := (min{X ljfc }£ =1 , . . . , mm{X d ^ =1 ) . 

In the sequel it will always be assumed that each X n = (Xx >n , . . . ,X<i t n) follows a d- 
dimensional ^i-norm symmetric distribution with radial part R ~ F, where R > 
a.s., and joint survival survival function H. By Theorem 12.121 it then follows that 
H(x) = ip(\\x\\i) for x G Ri, where ip = %BdF- Moreover, ip is then the generator 
of the survival copula C of X n , which is Archimedean. We now fix this notation and do 
not repeat it at each instance it appears in the sequel. 



42 



The ^i-norm symmetric distributions have the following closure property under the op- 
eration of forming minima: 

Lemma 6.1 The n-block minima W n are again l\-norm symmetric. The corresponding 
survival copula generators are ip n . 

Proof. For each n, W n is clearly confined to the interior of M.+ with probability one. 
Its survival function is 



{min{Xi jfc }£ =1 > xi} n • • • n {mm{X dtk }l =1 > x d }] 
f| f] {X itk > Xi } 



i=l,...A fc=l,...,n 



f| {X i>1>Xi } 



i=l,..,,d 

= r(\\x\h), 

where the third equality uses mutual independence of X±, . . . ,X n . The result now 
follows by Theorem [H2l □ 



Lemma 6.2 Suppose 1 — ij){x~ l ) 6 RV_i/g for some 9 < oo. Then there exists a 
sequence (c n ) of positive constants such that c^Wi^ =4> Y as n — >■ oo, where the survival 
function of Y is tpg . 

Proof. Using the identity max(— x) = — min(x) we get —W\ n = max{— Xi fc}? =1 . 
The random variables — Xi & are i.i.d. with distribution function if)(—x) for x < 0, which 
has right endpoint zero. Moreover, since 1 — ^(x -1 ) € RV-i/o by assumption, ip(—x) 
is in the Weibull domain of attraction by Theorem 13.51 By Proposition 13.61 there exist 
positive constants c n such that —c~ l W\ in => —Y ~ exp(— (— x) l l e ) = ipf u (—x). The 
result now follows upon noting that, for x > 0, 

F[Y >x}= F[-Y < -x] = ^ u (-(-x)), 

where we used the continuity of ip^ n . □ 



Before continuing we note that Proposition 14.121 implies that 1/9 £ [0,1]. Since the 
hypothesis of Lemma [6.21 excludes 9 = oo, we always have 9 £ [1, oo), which is consistent 
with the requirements of the Gumbel generator. 

We are now in a position to prove the next result on the limiting distribution of normal- 
ized n-block minima of £i-norm symmetric distributions. As a corollary we obtain the 
extreme value limits of Archimedean copulas. 

Proposition 6.3 Suppose 1 — ip(x _1 ) € RV_i/q for some 9 < oo. Then there exists a 
sequence (c n ) of positive constants such that c^Wn =>• Wq as n — > oo, where Wo is 
l\-norm symmetric with survival copula generator ip$ n - 
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Proof. For any constant a > 0, Lemma 16.11 implies that the survival function of 
a W n is ^"(^ll^lli)) with the one-dimensional marginals thus being ip n (ax). With 
c n as in Lemma 16.21 we therefore conclude that the functions x i— > ip n (c n x) converge 
weakly to ipf u as n — > oo. Hence ^{c^Wn > x] = ip n (c n \\x\\i) — > i/)g(\\x\\i) weakly as 
n — > oo, which means precisely that c^W n =>• V^o- Theorem 1 2 . 1 2 1 implies that Wq has 
the desired properties. □ 



Corollary 6.4 Suppose C is an Archimedean copula whose generator tp satisfies 1 — 
V>(z _1 ) £ -RVLi/e / or some 9 < oo. Then C n {u l / n ) — >■ Cf u (u) pointwise. 

Proof. Let the -2T n above follow the £i-norm symmetric distribution whose survival 
copula generator is ip. By Lemma 16.11 the survival copula of W n , and hence of aW n 
for any a > 0, is 

^ n (^- 1 (^ /n ) + . . . + v>"Vi /n )) = c^u 1 /"). 

Weak convergence of a sequence of joint distributions toward a limiting distribution 
implies pointwise convergence of the survival copulas toward the survival copula of the 
limiting distribution (see e.g. Feidt et al. |12|). By Proposition 16.31 the limiting distri- 
bution has survival copula Cg, with 9 as in the statement of the corollary. The result 
follows. □ 



In the case of £i-norm symmetric distributions, it is the survival copula that is Archimedean. 
This is the reason why we considered the minima of these distributions in order to arrive 
at the conclusion in Corollary 16. 41 However, in a situation where an Archimedean copula 
C is the copula (as opposed to the survival copula) of a (i-dimensional distribution Y 
under consideration, then C n {u l / n ) will be the copulas of the n-block maxima, defined 
as 

M n = (Mi >n , . . . , M djn ) := (max{yi,fc}fc=i, • • • , max{Y djfc }£ =1 ) , 

where Y n = (Y\ )n , . . . , Y^ >n ), n > 1, form a sequence of independent copies of Y. Indeed, 
if G is the joint distribution function of Y, independence of the M n implies (cf. the proof 
of Lemma I6.ip that 

F[M n <x} = (¥[Y < x]) n = G n {x). 

Hence the marginals are P[Mj in < Xj\ = G™(xi), and an explicit computation yields the 
form C Tl (it 1 / n ) of the copula of M n . Corollary 16.41 of course applies also in this case. To 
summarize, we have the following (as before, C is Archimedean with generator ip): 

(i) If C is the copula of the distribution Y, then the dependence structure of the 
corresponding n-block maxima can be approximated by the Gumbel copula for 
large n (if 1 — ^{x~ l ) £ RV_ a ). 

(ii) If C is the survival copula of the distribution Y, then the dependence structure 
of the corresponding n-block minima can be approximated by the Gumbel copula 
for large n (if 1 — ^(x _1 ) G RV_ Q ). 
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We finally remark that Proposition 14. 1 ll allows us to express the condition on ip in terms 
of a condition on the corresponding radial part R — namely, 1/R must have a regularly 
varying tail. 



6.2 Componentwise maxima of 4-norm symmetric distri- 
butions 

In this section, M n will denote the componentwise ra-block maxima of the i.i.d. ^i-norm 
symmetric random vectors X\, X2, ■ ■, i.e. 

M n = (Afi, nj . . . , M d>n ) : = (max{X ljfc }£ =1 , . . . , max{X djk }% =1 ) . 

As before, R ~ F with R > a.s. is the radial part of X n , and ip = 2Hrf-F is the 
corresponding Archimedean generator. The generated survival copula is denoted C. 
Unlike the minima W n considered in Section 16.11 the M n are not £i-norm symmetric. 
We thus need to take a different approach to analyze their asymptotic behavior, and 
it turns out that the theory of multivariate regular variation and multivariate extremes 
provides the proper framework. The stochastic representation of the £i-norm symmetric 
distributions renders the computations particularly simple. 

Since the multivariate extreme value theory is more complicated than its univariate 
counterpart, and because it is only used in this section, we refrain from giving an in- 
troduction to the general theory. Instead we give precise references for results that are 
used. The book |25j by Resnick contains all theory we need here, and much more beyond 
that. 

Proposition 6.5 Suppose X is l\-norm symmetric with radial part R G RV_ Q , a G 
(0,oo). Then X is in the domain of attraction of 

G(x) := exp {-/z([0, x] c )} , x > 0. 

where the exponent measure \i is given by 

H{x G M+\{0} I ||aj||i > r,aj/||a;||i G ^4} = r~ a a(A), r > 0,A G S d , 

and the spectral measure a is the uniform distribution on S d . 

Proof. Since X is ^i-norm symmetric, Proposition 12.91 implies that S := X/\\X\\i 
is uniformly distributed on S d , ||X||i = R, and \\X\\i and X/\\X\\ 1 are independent. 
Hence for any Borel subset A C S d and x > 0, 

n^oo P[||X||i>u] «~>oo F(u) 11 11 

Setting t := 1/F(u) and at '■= (1/F)^(t), we also have 

B>MBij = , ^-mi^iu > x, xnx h e a\. 

u^oo ir[\\X\\i>u\ t^tco 
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We can now apply Corollary 5.18 (b) in Resnick to reach the desired conclusion with 
ff = P[S £(■)]. ' □ 



This means in particular that X is multivariate regularly varying with tail index a 
and spectral measure a. We now express the exponent measure /i directly in terms of 
a. 

Lemma 6.6 Let /x and a be as in Proposition ] 6.5[ Then 



fi([0,x] 



max 



s d 



a(dw), 



x e 



Proof. We slightly modify the argument in Resnick [25], p. 267-268, to account for 
the fact that a need not be one. 

Define the polar coordinate transformation T : x i— > (||ac||i,ac/||x||i). Then by Propo- 
sition [631 A 4 ° T _1 ( r i^.) = r~ a a(A) for r > 0, A C 5 d , with density ar~ a ~ 1 dra(dw). 
Of course /i([0, a;] c ) = /i o T _1 o T([0, ;c] c ), and the same computations as in [25], p. 267 
yield 

T{[0,x} c ) = l{r,w) | r >min(^,...,^ 



We can now integrate the density of fi o T over this region: 



fi([0,x] 



ar a 1 dra(dw) 



T([0,x] c ) 

I 

S d J hr>mm(^-,...,^\\ 



ar 



-a-1 



dra(dw) 



mm 



r xi 


xd y 







a(dw) 



which is equal to the asserted expression. 



□ 



It is now possible to compute the copula Co of the limiting distribution G. Although 
it does not have a particularly elegant form (except in the bivariate case, as we shall 
see), it does have an important interpretation: if C is the copula of the distribution Y, 
then corresponding n-block minima have a dependence structure approximated by Co 
for large n. Equivalently, Co describes the dependence of the n-block maxima of Y in 
case C is the survival copula of Y. See the discussion at the end of Section f6.il 

First, to obtain the marginals G\ of G (they are all equal due to exchangeability) we 
note that by Proposition 16.51 G\(x) = exp{— fi(B x )}, where B x := [0, (x, oo, . . . ,oo)] c . 
Lemma 16.61 then gives 

(J,(B X ) = x~ a I Wio(dw) = c a x~ a , 
Js d 
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with c a = J sd wfa(dw). Hence G\{x) = exp{— c a x~ a }, which in particular means that 
G\ is of the same type as & a . This is to be expected since the marginal survival function 
of X is ip = yHdF, which is in MDA(<3? Q ) by Proposition 14.31 and Theorem 13.51 (we still 
work under the assumption of Proposition 16.51 R G RV_ Q with a £ (0, oo)). 

Inverting G\ gives G^{u) = (-c-Mog-u)" 1 /", and we may apply Lemma [6.6l to compute 
the copula of G as Cq{u) = G(G^ 1 (u\), . . . , GT/ 1 ^)): 

f / logui logu rf a \ \ 

/ max < w l , . . . , w d > a(dw) > 

Js d { c a c a J J 

— / max{— w" log ui, . . . , — log u^} o~(dw) 1 . 
c a Js d J 



Cq(u) = exp 



exp 



Although by Sklar's Theorem we already know that Co is a copula, one may readily 
verify that the marginals are standard uniform. It is also easy to check that Cq satisfies 
the EV copula property Cq (u l / n ) = Cq(u). 

Example 6.7 For d = 2, Co can be computed explicitly due to the simple form of the 
spectral measure a . The first step is to calculate V "(xi,^) := f s2 inax.{wiXi,W2X2}cr{dw) 
for x±,X2 > 0. We parameterize S 2 by t := w\ and obtain 



V(xi,x 2 ) 



f'l f'K f'l 

/ m&x{x 1 t a ,x 2 (l-t) a }dt= / x 2 (l-t) a dt+ Xl t a dt, 

JO JO Jk 



where k = x^ + x\ ) solves the equation x\K a = X2(l — K) a ■ The two integrals 

on the right side are straightforward to evaluate, and the result is 

V(xi,x 2 ) = l ]^ a {^i +x 2 - (xi 1/a + x 2 1/a ^j ^ • 

The quantity c a is obtained similarly: c a = J sd wfa(dw) = J t a dt = (1 + a)~ 1 . Piec- 
ing things together we get Cq(ui,U2) = exp { — (1 + a)V{— log u\, — log ^2)}, or, more 
explicitly, 

C ( Ul ,u 2 ) = ^ 2 exp {((-logu^ 1 ^ + (-logns)- 1 / )^! . 
This is nothing but the well-known Galambos copula. 

We conclude this section with a remark regarding the fact that we only considered the 
heavy-tailed case, i.e. R S RV_ Q for a £ (0,oo), or, equivalently, R being in the Frechet 
domain of attraction. What would happen if R were in the Gumbel domain? Assuming 
unbounded support for simplicity we have 

Hm p[| W |, > u X xnx h g a] = fM 

n^oo P[||^||l > u] F(u) 1 1 
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for all Borel sets A C The limiting measure /i in Proposition 16.51 would thus be 
identically zero, /z([0, £c] c ) = for x £ Rl\{0}, meaning that the limit distribution 
would be degenerate with a point mass at the origin. Although a different normalization 
(such as (• — u)/a(u) as in Theorem 13. 5p might still yield meaningful results, we do 
not pursue this here since it involves translations and thus is non-standard in the basic 
theory of multivariate regular variation and multivariate extremes. 
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Chapter 7 

Archimedean threshold copulas 



In the univariate theory of extremes, the problem of determining the limiting distri- 
butions of maxima (or minima) is intimately related to that of finding the limiting 
distributions of threshold exceedances. In the multivariate setting things become more 
complicated, since thresholds can have many different forms and diverge in many differ- 
ent ways. In this chapter we consider the limiting dependence structures of componen- 
twise exceedances when the underlying distribution has an Archimedean copula. The 
crucial tool will, as before, be the £i-norm symmetric distributions. 



7.1 Lower threshold copulas 

Let C be a d-dimensional copula and suppose (Ui, . . . , U d ) is distributed according to 
C. For given quantile levels a% £ (0, 1], 

F[U l </3 1 ,...,U d </3 d \U 1 <a 1 ,...,U d <a d ], fte[0,a 4 ], (7.1) 

defines a distribution function on [0, a{\ x ... x [0,aj. The copula of this distribution 
is called the lower threshold copula of C; the limit as each oti decreases is called the 
limiting lower threshold copula (LLTC). If C is Archimedean, the LLTC is always the 
Clayton copula, whose generator is tp^(x) = (1 + Ox)^ 1 ^ 9 . This result follows in a very 
transparent way using the connection with £i-norm symmetric distributions. 

We need to be more precise about the statement "each «j decreases". It can happen 
that the set Lq := {u E [0, l] d \ C(u) = 0}, called the zero set of C, has positive measure 
(w.r.t. Lebesgue measure on [0, l] rf .) If C is Archimedean, this happens when the gener- 
ator t/j is non-strict. In this case one cannot let cti J, 0, since eventually the probabilities 
in ()7.ip will not be well-defined. Instead, we should require a := (oci, . . . ,a d ) ^ Lq 
and ot — >■ uq for some uq G OLq. If Lq ^ {0} there are different possible choices for 
uq, and even when Lq = {0} there are no guarantees a priori that the LLTC is in- 
dependent of how a approaches uq = 0. To avoid ambiguities, one therefore usually 
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takes a.\ = a.2 = ■ ■ ■ = oca and lets 014 decrease until it hits 8Lq (this is the case in 
e.g. McNeil et al. |20| and Juri and Wiithrich |18|). Since in the Archimedean case, 
Lq = {u G [0, l] d I Yli=i i } ~ 1 { u i) > V' _1 (0)}, we are able to handle this complication 
by supposing that Yli=i ( Q «) t ^ _1 (0)i which ensures that a approaches Lq from 
above. Moreover, it turns out that neither the relative rates of convergence of the oti, 
nor which uq G Lq they tend to, affects the LLTC. We therefore allow the quantiles ctj 
to be different in order to emphasize this fact. 

Now, suppose C is Archimedean with generator ip. Since ip -1 is decreasing, {U\ < 
u] = {V> -1 (l7i) > ^~ X iu)} for u G [0,1]. With X { := "i := V* -1 ^) and 

:= ^ _1 (/3j) — ^~ 1 (aj) > 0, (|7.ip can thus be written as 

P[C/! < fa, . . . , U d < fii I Ui < at, . . . , U d < a d ] = 

= F[X 1 -v 1 >x 1 ,...,X d -v d >x d \X 1 >v 1 ,...,X 1 >v 1 } (7.2) 
=: H v (x), 

where x = (xi, . . . , x d ) with xi G [0, ^" 1 (0) — Vj\ and v = (vi, . . . , v d ) denotes the vector 
of thresholds. Since all Xi are continuous, the inequalities can be taken as strict, and 
we see that H v defines a survival function on the interior of MrL. 

Lemma 7.1 X = (Xi, . . . , X d ) as defined above is i\-norm symmetric with survival 
copula generator ip. Moreover, X v ~ H v is also l\-norm symmetric with survival copula 
generator tp v given by 

ip(x + |M|i) 



i> v (x) 



« 1 



Proof. The statement about X follows from Corollary 12.131 An explicit computation 
yields 

- P[Xl - vi > xx, . . . ,X d - v d > x d ] ip(\\x\\i + \\v\\i) 

H v (x) = — 



F[X 1 >v 1 ,...,X 1 >v 1 ] n\\v\\i) 
whence the last assertion due to Theorem 12.121 □ 



Due to (|7.2|) . the survival copula corresponding to H v , which we denote by C v , is the 
lower threshold copula of C. Now, a. L \. if and only if Vi f V' _1 (0), so in view of the 
discussion in the beginning of this section the LLTC of C is given by the limit of C v as 

IMIi t^ _1 (o)- 

Proposition 7.2 Let if) and X v be as in Lemma \ 7.1\ and suppose if) is in some do- 
main of attraction. Then there exists a normalization /?(•) such that X v / f3(\\v\\i) con- 
verges weakly to an l\-norm symmetric distribution with survival copula generator if)g 
as ||u||i — > ^ _1 (0). The parameter 9 is given by: 

(i) If ip is in the Frechet domain, i.e. ip(x) G RV_ a , for some a G (0, 00), then 
6 = 1/a. 
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(ii) If tjj is in the Gumbel domain, then 9 = 0. 

(Hi) If tp is in the Weibull domain, i.e. tp(x* — x^ 1 ) £ RV_ a for some a £ (0, oo) and 
where x* = ^ _1 (0), then 9 = —1/a. 

Proof. In the Frechet case Lemma 17.11 and the fact that ij) £ RV_ a yield 



\\ v \\i 



> x 



^(H|i(l + INi)) 



(l + N|i)^ = ^%(IMIi) 



v i 



as ||w||i -> ^ _1 (0) = oo. In this case we may thus choose j3{x) = x. In the Weibull case, 
let t be defined via ||i>||i = x* — t , and set f3(x) := x* — x. Then 



> x 



if>(\\v\\i + 



V l) X i 



v \\l) 



i/j(x* 



\xh )t 



ip(x* — t 1 ) 



-+ (1- 11*110° =^ 1 /o(ll*lli), 



v l 



x 



since i/j(x* — x^ 1 ) £ RV_ Q . In the Gumbel case we take j3 to be the auxiliary function 
of ijj. The right side of the first line in the previous display now tends to exp{— ||sc||i} = 
tpQ l {\\x\\i) due to the property (|3.2|) in Theorem 13.51 In each of the three cases, the 
assertion that the limit is ^i-norm symmetric follows from Theorem 12.121 □ 



Remark. The function /?(•) has the standard interpretation as the proper normalization 
for the conditional univariate exceedances {X\ —v\X\ > v) to converge to a Generalized 
Pareto Distribution (cf. Balkema and de Haan [1] and Pickands |23j). 

Note that the parameter 9 can be negative. This happens when a := ^ _1 (0) < oo and 
1 — ip(a — x^ 1 ) € RV- a , a > 0. From Proposition 14.91 we know that one has the limitation 
9 = —1/a > — l/(d— 1), which is consistent with the range of parameter values for which 
the Clayton copula is defined (see McNeil and Neslehova [19J, Example 4.) 

Corollary 7.3 Let C be an Archimedean copula whose generator if) is in some domain 
of attraction. The corresponding limiting lower threshold copula (LLTC) is then Clayton 
with parameter 9 as in Proposition \7.2\ 

Proof. Denoting again the copula if H v by C v , the LLTC is given by the pointwise 
limit of C v as \\v\\i — > ^ _1 (0). But scaling by a positive constant does not change the 
copuls0i so X w //8(||t>[|i), like X v , has survival copula C v . Since weak convergence of 
the distributions implies pointwise convergence of the copulas (cf. Feidt et al. p2]), the 
assertion follows from Proposition 17.21 □ 

1 In fact, copulas are invariant under strictly increasing transformations Ti of the components of the 
underlying random vector — if (Y\,Y2) ~ H with marginals Hi, H2 has copula C, then the copula of 
(T 1 (Y 1 ),T 2 (Y 2 )) is H(Ti~ olio Ht('),Tf o T 2 o H^(-)) = C by the properties of generalized inverses. 
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The fact that only the sum ||v||i has to tend to ip 1 (0) immediately gives the form of 
all marginal LLTCs, which are defined as the limits of 

P[Bi < A, i € J | Ui < an, i e J], A e [0, a*], (7.3) 

as (ai)igj tends to the corresponding zero set, where J C {l,...,d} is a non-empty 
index set. More precisely, we have 

Corollary 7.4 Let C be an Archimedean copula whose generator ip is in some domain 
of attraction. All corresponding marginal LLTCs are then Clayton with parameter 9 as 
in Proposition \ 7. S\ 

Proof. (|7.3p is obtained from (|7.ip upon setting Oj = 1 and A = 1 for all i ^ J. 
This corresponds to a;, = «j = 0, i ^ J. Letting the remaining Vi increase such that 
IMIi t "0 _1 (O)) the assertion is inferred from Proposition [72] as in Corollarv l7.3l □ 

In a modeling situation, there might be given requirements on the behavior of the LLTC 
of the copula C . If one is working with the stochastic representation in terms of ^i-norm 
symmetric distributions, the results in Chapter 0] translate the three conditions on tj) 
given in Proposition 17.21 into conditions on the corresponding radial part R. It may also 
be the case that a certain R has been chosen through some inference procedure, and one 
then wants to know the implications for the extremal behavior of the limiting copula. 
Again, Chapter H] provides the necessary link. 

We conclude with a remark on the connection between threshold exceedances and the 
componentwise maxima treated in Section [672J One may view the conclusion of Example 
16. 71 as an extension of the well-known fact that the Galambos copula is the EV limit of the 
survival copula of the bivariate Pareto distribution (see McNeil et al. |20j . Example 7.50). 
In fact, the (symmetric) bivariate Pareto is £i-norm symmetric and its survival copula is 
Clayton (this is easily checked: its joint survival function is H(xx, x%) = [x\ + x<i + l)~ a 
with a > 0). 

Now, by Proposition 17.21 any heavy-tailed bivariate £i-norm symmetric distribution has 
asymptotically the same shape as the bivariate Pareto far out in the tail. In the case 
of £i-norm symmetric distributions, therefore, the following recipe produces correct re- 
sults: given X = (X\,X2), compute the limiting distribution X^ of the threshold 
exceedances. Take the componentwise maxima of X^ and compute the corresponding 
limiting distribution. The copula of this limit is the same as the copula of the distribu- 
tion that is obtained by directly taking the limit of the componentwise maxima of X. 
Intuitively such a result is reasonable; for large n, the n-block maxima will be located 
far out in the tail, where the underlying distribution looks approximately like the limit 
of the exceedances. 

The last statement gives rise to an interesting question: in general, what are the condi- 
tions on the underlying multivariate distribution for these heuristics to be correct? I.e., 
under which circumstances does the recipe described above give correct results? 
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7.2 Upper threshold copulas 



In analogy with the lower threshold copulas, one may define the upper threshold copula 
for given quantiles ai, ■ ■ ■ , a d as the copula of the distribution 

V[Ui > Pi,...,U d > p d \ Ux >a u ...,U d >a d ], fte[a„l], (7.4) 

where again (Ui, . . . , U d ) are distributed according to the copula C. The limit as the 
ai "f 1 is called the limiting upper threshold copula (LUTCjE and our aim is to investigate 
the extent to which our previous methodology can be applied to compute the LUTC in 
the case when C is Archimedean. 

We first define := ^ (f7j), Vi := ip~ 1 (a{) and x% := The quantity (j7.4j) can 

then be expressed in terms of the £i-norm symmetric random variable X = (X\, . . . , X d ), 
whose survival copula is C (that X is £i-norm symmetric follows as before from Corollary 
ET3J: 

F[Ui > fa, . . . , U d > (3 d | U x > ai, . . . , U d > a d ] = 
= F[X < x | X < v] 

= : H v (x), Xi S [0,vi\. 

To get the inequalities right we used strict monotonicity of and the fact that X 
has continuous marginals. It is straightforward to show that H v is not the distribution 
function of a £i-norm symmetric distribution — indeed, the support of (X \ X < v) is 
[0, v], which is not consistent with ^i-norm symmetry. Its survival copula is therefore not 
Archimedean, despite the fact that C is. Hence the methodology used in the LLTC case 
cannot be employed, and this difficulty cannot be overcome using basic multivariate 
extreme value theory as easily as in the EV copula case in Section 16.21 In fact, the 
ULTC cannot be explicitly computed, but we shall derive a distribution function whose 
survival copula is the ULTC of C. For simplicity we carry out the calculations in two 
dimensions to illustrate the procedure. Generalizing the results to higher dimension is 
straightforward. 

With d = 2, we have 

F[X 1 <x 1 ,X 2 <x 2 ] = l-F[X 1 >x 1 ]-F[X 1 >x 1 ]+F[X 1 >x 1 ,X 2 >x 2 ] = 
= 1 — iff(x{) — i>{x%) + ip(xi + x 2 ) = 
= G(x 1 ) + G(x 2 )-G(x 1 +x 2 ), 

where G(x) := 1 — ip{x) is the marginal distribution function of X. In the general case 
one may use the sieve formula to express F[X < x] in terms of G(^2 ieJ Xi) for subsets 
J C {1, . . . , d}. Proceeding with the case d = 2 we obtain 

_ G(x 1 ) + G(x 2 )-G(x 1 +x 2 ) 



2 We do not have the same problem as in the LLTC case — the set L\ :— {u £ [0, l] d | C(u) = 1} is 
always a nullset. Indeed, C(«i, . . . , Ud) < C(«i, 1, . . . , 1) = U\ implies that C(u) < 1 for u ^ 1. 
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Now, on f 1 is equivalent to Vi \. 0, so in order to have a non-degenerate limit we 
define X = (X\,X 2 ) as the normalized [0, l] 2 -valued random variable (depending on v) 
distributed according to 

P[Xi < xi,X 2 < x 2 ] = H v (x 1 v 1 ,x 2 v 2 ), Xi G [0,1]. 

The copula is not affected, since it is invariant under strictly increasing transformations 
of the components. We may therefore consider the copula of X as v\ and v 2 tend to 
zero. 

Proposition 7.5 Suppose 1/X\ ~ ^(x -1 ) is in some domain of attraction, and let 
vi = v 2 = v. In particular we then have 1 — ip{x~ l ) G RV- a for some a G (0, 1], and if 
in addition a < \, then 

Ho{xx,x 2 ) := limPLY! < x u X 2 < x 2 ] - X? + ^ ~ + 



v^o 2-2° 

Proof. Theorem 13.51 implies that G(x~ 1 ) = 1 — ^(x -1 ) G ^^(O) for some a G (0, oo] 
and by Proposition 14.12] a < 1. Since v\ = v 2 = v this yields 

Fry < ^ Y<^\ G(x x v) + G(x 2 v) - G( Xl v + x 2 v) 

nxi< Xl ,x 2 <x 2 ] - 2G{v) _ G[2v) 

G'(xid) . G(x2v) _ G(v(xi+X2)) 

G(v) G(v) G(v) 

~ 9 _ G(2v) 

Z G(v) 

^ xf + x% - {x\ + x 2 ) a 

□ 



The assumption v\ = v 2 is not restrictive if we are only interested in the copulas. Indeed, 
if v 2 /v\ — > c > 0, then one can show, using that the convergence in Proposition 17.51 is 
locally uniform (a basic result from Karamata theory), that Hq(xi,cx 2 ) is obtained in 
the limit. This just represents a linear scaling of the second component, which leaves the 
copula unchanged. The marginals Hq(x, 1) = (x a + 1 — (x + l) a )/(2 — 2 a ) do not have 
closed form inverses, so we are not able to explicitly compute the copula or the survival 
copula. As a final remark we mention that more precise results can be obtained in the 
limiting case when a = 1, see Charpentier and Segers [6]. 
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